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Optimal Shift Scheduling with 
Multiple Break Windows 



Turgut Ay kin 
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y his paper presents a new integer programming model for optimal shift scheduling with 
X multiple rest and lunch breaks, and break windows. A set-covering approach for this prob- 
em was originally developed by Dantzig (1954). Since then, a number of set-covering-based 
formulations have been proposed in the literature. These formulations require an integer variable 
for every shift type, shift start time, and rest/lunch break placement combination. Unfortunately 
the number of integer variables required is rather large, making them impractical to solve for 
an optimal solution in most applications. We present a new approach in which a set of break 
variables is introduced for every shift-break type combination to determine the break place- 
ments. This approach leads to a significantly improved integer programming model requiring 
substantially smaller number of variables and computer memory. We tested the proposed ap- 
proach with 40 test problems involving between 1,728 and 8,640 shift variations, and five de- 
mand patterns. Our results showed that the proposed formulation is very useful in solving laree 
shift scheduling problems optimally. 

(Manpower Scheduling; Service Operations Management; Integer Programming) 



1. Introduction 

The shift scheduling problem arises in a variety of 
service organizations such as telephone companies, 
airlines, hospitals, police departments, and banks 
and involves scheduling of manpower to meet de- 
mand that changes over the course of an operating 
day. Employees are assigned to various shifts spec- 
ified by shift type, length (e.g., four-hour part-time, 
nine-hour full-time), shift start time, and the number 
and length of relief/lunch breaks. To provide flexi- 
bility in scheduling relief and lunch breaks, break 
windows (time intervals within which employees 
must start and complete their breaks) are often spec- 
ified. Thus, the general shift scheduling problem in- 
volves determining the number of employees to be 
assigned to each shift and specifying the'timing of 
their relief and lunch breaks. It is known that the 
flexibility provided in shift lengths, shift start times, 
and break placements lowers the total staffing cost 

0025-1 909/96/4204/0591 $01. 25 
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and the number of employees needed (Showalter 
and Mabert 1988). 

Since its introduction by Edie (1954), this problem 
and a number of related ones (e.g., the tour scheduling 
problem, the generalized employee scheduling prob- 
lem) have attracted considerable attention in the litera- 
ture. Both heuristic and integer programming based ap- 
proaches have been proposed. The integer program- 
ming based approaches traditionally have been based 
on the following set-covering formulation originally 
suggested by Dantzig (1954). 

minimize £ c k X k/ (1) 

Jtt= K 

subject to Y a kt X k > b t for all / G T, 

kCK 

X k > 0 and integer, (2) 

where K is the set of all shifts, T is the set of Zplanning 
periods that the shift schedule covers, b, is the number 

Management Science/ Vol. 42, No. 4, April 1996 591 



AYKIN 

Optimal Shift Schaiulm^ 



of employees needed in period t, c k is the cost of assign- 
ing an employee to shift k, a kt is equal to one if period / 
is a work period for shift k and zero otherwise, and X ; 
is an integer variable defined as the number of employ- 
ees assigned to shift k, k € K. 

Given different shift types, lengths, shift start times, 
number and length of relief (rest) and lunch breaks and 
the time windows for these breaks, the set-covering for- 
mulation requires that all possible combinations of 
these features be included as different shifts in K. When 
relief and lunch breaks and break windows are in- 
cluded, the number of shifts in K increases rapidly. Con- 
sequently, standard integer programming tools fail to 
be useful. The difficulties caused by problem size are 
extensively discussed in the literature. These difficulties 
and the expected benefits from the added scheduling 
flexibility provided by break windows led researchers 
to study heuristic approaches to this problem. Segal 
(1974), for instance, considered a telephone operator 
scheduling problem in which, depending on the shift 
length, each shift may include up to two 15-minute re- 
lief and one 30-minute lunch break. Break windows are 
specified as 1.5 hours long so that each 15-minute relief 
break may start at six different rimes (assuming 15-min- 
ute planning periods). He proposed a network flow 
based heuristic for solving the shift assignment and the 
break placement problems separately in three phases. 
Keith (1979) considered the operator shift scheduling 
problem with lunch, early and late relief breaks at Illi- 
nois Bell Telephone Company. He proposed a slightly 
different set-covering model by ignoring break sched- 
uling flexibility and assuming predetermined relief and 
lunch break times. The heuristic solution procedure 
proposed first solves the linear programming relaxation 
of the integer model and develops a feasible shift sched- 
ule using a rounding heuristic. Break windows are ex- 
amined in the second part in an attempt to improve the 
shift schedule found. Keith (1979) reported that all Bell 
System Operating Companies were using a scheduling 
system based on either Keith (1979) or Segal's (1974) 
models. The shift scheduling problem with multiple re- 
lief and lunch breaks and break windows was also stud- 
ied by Henderson and Berry (1976). They reported that, 
in their case, the number of different shift variations (in 
K) was 7120, and noted that there may be as many as 
15,000 shift variations (Klasskin 1973) in more general 



cases. They proposed n number of heuristics to solve 
this problem in two phases. Other heuristic approaches 
for this problem are found in Buff a et al. (1976), Glover 
et al. (1984), Taylor and Huxley (1989), Thompson 
(1990), and Brusco and Jacobs (1 993), among others. Be- 
sides these efforts, a number of researchers studied sim- 
pler cases involving either only one lunch break win- 
dow (three start times for a lunch break, Bailey and 
Fields 1985) or fixed lunch break placement (one hour 
lunch break between the fourth and the fifth hours of 
work, Brusco and Jacobs 1993) while not including relief 
breaks. 

While the solution aspect of the general shift sched- 
uling problem has been attracting much interest, far 
less attention has been devoted to modelling options. 
Most studies used Dantzig's set-covering formulation 
or an extension thereof. Modelling a special case of 
the problem involving shifts with a single break has 
been considered by several researchers. Moondra 
(1976) considered a full-time shift with a lunch win- 
dow allowing two break start times and part-time 
shifts working between four to eight hours without a 
lunch break. He represented the length of the part- 
time shifts implicitly and proposed a formulation as- 
suming that 50% of the employees assigned to the full 
time shift will take their lunch break in the first break 
period (in the lunch window) and the remaining 50% 
in the second break period. More recently, Bechtold 
and Jacobs (1990) described an integer programming 
formulation using a set of integer variables to model 
break assignments. A variable of this type represents 
the total number of employees on all shifts starting 
their break in a particular planning period. In the 
problem they considered, employees are allowed a 
single (lunch) break. Their approach further assumes 
that the duration of the break is identical for all shifts 
and that every shift contains only a single break win- 
dow consisting of contiguous planning periods. 
Moreover, the approach proposed in Bechtold and 
Jacobs (1990) is limited to cases involving less-than- 
24-hour operations and may fail to schedule breaks 
within their respective break windows in cases in- 
volving "extraordinary break-window overlaps" (de- 
scribed as a situation in which the break window for 
a shift is included entirely in the break window of 
another shift, and the larger break window starts at 
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least one planning period earlier and ends at least 
one planning period later than the smaller break 
window). 

The number and the duration of breaks an em- 
ployee takes are determined by many factors includ- 
ing legal restrictions, company policies, and union 
agreements. Typically, an employee working seven to 
nine consecutive hours a day receives one lunch break 
and two rest breaks, one before and one after the 
lunch. Shifts with shorter work span may be assigned 
fewer breaks. The length of a lunch break is usually a 
half hour to an hour and a rest break 15 to 30 minutes. 
Further, in some systems (e.g., telephone operator 
centers), employees may be assigned to split shifts 
consisting of two work periods separated by a break 
lasting two to four hours. A rest break is also pro- 
vided in each work period. Thus, in many applica- 
tions, the problem involves multiple breaks of un- 
equal duration and multiple disjoint break windows. 
In this paper, we present a new integer programming 
formulation for this problem. To model the schedul- 
ing flexibility provided by the break windows, we in- 
troduce integer variables for the numbers of employ- 
ees assigned to a shift and starting their breaks in dif- 
ferent planning periods within the associated break 
windows. Thus, the break variables are associated 
with shifts and their values determine the break 
placements. Gaballa and Pearce (1979) considered a 
telephone sales system operating less than 24 hours 
assuming that the employees receive only one lunch 
break and no relief breaks. They proposed an integer 
programming formulation in which the scheduling 
flexibility provided by the lunch windows is mod- 
elled implicitly with additional integer variables. The 
main disadvantage of their model was that it required 
more integer variables and constraints than the equiv- 
alent set-covering formulation in the case they con- 
sidered. Our formulation is also based on the implicit 
representation of the break placements. We consider 
the problem with multiple rest and lunch breaks and 
multiple break windows, and introduce different sets 
of break variables for different shift-break tvpe com- 
binations. More importantly, we show that/although 
not beneficial in the case considered by Gaballa and 
Pearce (1979), this approach leads to a significantly 
improved integer programming model requiring a 



substantially smaller number of variables than the 
equivalent set-covering model in the general shift 
scheduling problem involving multiple breaks and 
break windows. The number of nonzeros and the den- 
sity of the /1-matrix are also reduced significantly as 
compared to the set-covering formulation. In fact' the 
longer the break windows, the more efficient (com- 
pared to the set-covering approach) the new formu- 
lation becomes. 

The proposed formulation is not restricted to a par- 
ticular type of shift scheduling problem. Shifts may in- 
volve multiple rest and lunch breaks and multiple dis- 
joint break windows. Breaks and break windows asso- 
ciated with different shifts may be different in number 
and length, and there may be break window overlaps. 
Like the set-covering formulation, the proposed ap- 
proach is applicable to problems involving 24-hour con- 
tinuous operations as well as less than 24-hour opera- 
tions. 

In the computational results section, we report our 
experience with test problems involving between 1,728 
and 8,640 different shift variations using the integer pro- 
gramming option of LINDO, a general purpose ILP 
code. The largest problem we solved with the proposed 
approach using LINDO involved 32,928 shift variations 
(Aykin 1995). These represent, to the best of our knowl- 
edge, the largest shift scheduling problems solved to 
optimality in the literature. Although the size of the 
set-covering model suggests that the application of 
column-generation-based techniques may be helpful 
(for a column-generation-based heuristic for the tour 
scheduling problem, see Easton and Rossin 1991), this 
is yet to be studied. In contrast, the proposed approach 
provides a new model enabling the use of standard in- 
teger programming tools. 

The remainder of the paper is organized as fol- 
lows. In the next section, we discuss an example il- 
lustrating the proposed approach, define the nota- 
tion and present the new formulation for the general 
shift scheduling problem. Computational results ob- 
tained with 40 test problems are presented in §3. In 
this section, we also discuss and present, for the first 
time, an optimal solution to a case reported in the 
literature by Segal (1974). Finally, §4 summarizes 
our results and discusses a number of research di- 
rections. 
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2. Problem Formulation 

2.1. An Illustrative Example 

To illustrate our approach, we consider a simple case 
involving only two shifts, K - (1, 21, with a nine-hour 
shift span (defined as the duration of a shift including 
breaks). Employees are given one half-hour lunch break 
and two 15-minute relief breaks, leaving a work span of 
eight hours for each shift. Shift 1 starts at 7:00 and shift 
2 starts at 9:00, and the system provides service from 
7:00 to 18:00. Thus, there are 44 15-minute intervals in 
a work day, t = 1, . . . , 44. Assume that the ideal start 
time for a lunch break is in the middle of the work span; 
that is, after four hours of work. The ideal relief break 
start times are in the middle of the four-hour work in- 
tervals; that is, two hours after the start of a shift and 
two hours after the completion of a lunch break starting 
at the ideal lunch time. Assume that all break windows 
are 1.5 hours long and starting half an hour before the 
ideal break start times. Then the time window for the 
first 15-minute relief break of shift 1 is from 8:30 to 10:00, 
for the lunch break from 10:45 to 12:15, and for the sec- 
ond 15-minute relief break from 13:15 to 14:45. Thus, 
the lunch break for an employee working this shift may 
be scheduled in five different ways: from 10:45 to 11:15, 
11:00 to 11:30, 11:15 to 11:45, 11:30 to 12:00, and 11:45 to 
12:15. And each 15-minute relief break may be sched- 
uled in six different ways. The shifts, ideal break start 
times and the break windows are shown in Figure 1. 

Let X, and X 2 be the numbers of employees as- 
signed to shifts 1 and 2, respectively. Let U kl be the 
number of employees working shift k and taking their 
first 15-minute relief break in period t. The employees 
assigned to shift 1 may take their first relief breaks in 
periods t = 7, 8, 9, 10, 11, 12 (that is 8:30, 8:45, 9:00, 
9:15, 9:30, and 9:45), and the employees assigned to 
shift 2 in periods t = 15, 16, 17, 18, 19, 20. Similarly, 
define W kt as the number of employees working shift 
k and starting their lunch break in period /. The em- 
ployees assigned to shift 1 may start their lunch 
breaks in periods t = 16, 17, 18, 19, 20, and the em- 
ployees assigned to shift 2 in periods t = 24, 25, 26, 
27, 28. Define V kt as the number of employees working 
shift k and taking their second relief break in period 
f. The employees working shift 1 may take their sec- 
ond relief breaks in periods / = 26, 27, 28, 29, 30, 31, 



Figure 1 The Shifts, Ideal Break Times, and Break Windows in the 
Two-Shift Example 
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and the employees working shift 2 in periods / = 34, 
35, 36, 37, 38, 39. The break variables U kt , W kt , and V kt 
are defined only for break start times in the associated 
break windows. The objective function can be taken 
as to minimize total cost. 

minimize c x X } 4- c 2 X 2 , (3) 

where c x and c 2 are the costs of assigning an employee 
to shifts 1 and 2, respectively. 

If period t is in the shift span of k, then the number 
of employees assigned to shift k and working (i.e., not 
on break) in period t is given by 

X k , if t is not a break start time in any 

of the break windows, 
X k — U kt , if t is a first relief 

break start time, 
Xk - Wia, if t is and (t - 1) is not a 
y f _ I lunch break start time, 

kt X k - VV, (/ _ n - W kt , if both (t - l)and t (4) 
are lunch break start times, 
Xt: "* Wku-u, if (f - 1) is and / is not a 

lunch break start time, 
X k - V kt , if t is a second relief 

break start time. 

Let b t be the number of employees needed in period 
t and let a kt be one if period f is in the shift span (a work 
or a break period) of shift k and zero otherwise. Then, 
the demand constraint for period t can be stated as 
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/»,. 



(5) 



The demand constraint for period 20, for instance, is 
X, - 1<V, |M - IV, y) + X. - U Z2 , > } hVf {6) 

since period 20 is within two possible Junch breaks for 
shift 1 and within the first relief break window of shift 2. 

In order to schedule two relief breaks and one lunch 
break for every employee, we include one constraint for 
every break window as: 



X, - 



U kf = 0, k G K; 



: in the 1st Kolkt 
Brcik Window t>t £ 



(7) 



x * - I W kt = 0, A s K; (8) 

/ in tho Luiu-h 
Brwifc Windnw t>t fr 



X, 



/ in tin- 2nd Relict' 
Brt\ik Window uf A- 



v*, - o, k e k. 



(9) 



Although with these break constraints, sufficient 
numbers of relief and lunch breaks are scheduled for all 
employees, their timing for an employee working a spe- 
cific shift is not determined. The assignment of employ- 
ees to specific combinations of scheduled breaks does 
not affect the total number of employees needed to meet 
the demand during a work day. The set-covering model, 
on the other hand, determines the number of employees 
assigned to every possible rest/lunch break placement 
combination explicitly and thus requires an excessive 
number of variables. 

Employees working a shift can be assigned to specific 
relief and lunch break times specified with the break 
variables in a variety of ways. One approach may in- 
volve assigning employees to earliest first relief, lunch, 
and second relief break times available (the earliest 
break time rule). To illustrate, we solved the two-shift 
example described above using the part of the demand 
data given in Segal (1974) for the 44 15-minute planning 
periods between 7:00 and 18:00 (for an 11-hour work 
day covered by the two shifts). The optimal solution is 
shown in Table 1. In this solution, 21 employees are 
assigned to shift 1 and 87 employees to shift 2. The 
scheduled first relief, lunch and second relief break 
times (with the planning period index t within paren- 
theses) together with the number of employees who will 



be on break at those times are also shown in Table 1. 
Using these break times and the earliest break time rule, 
the employees working shifts J and 2 can b* assigned 
to specific break time combinations as shown in Table 
2. Assigning employees to break combinations usually 
requires only a fraction of a CPU second and can even 
be done manually. 

2.2. Integer Programming Model 

To facilitate the presentation of the model, without los- 
ing generality we assume that every employee receives 
two relief breaks and one lunch break. Further, it is as- 
sumed that the duration of the relief breaks is one plan- 
ning period and the duration of the lunch break is two 
planning periods. Let X, be the number of employees 
assigned to shift k G K, where K is the set of all shifts 
including all shift types /lengths, and allowed shift start 
times. Let U kl , W kt and V kt be the numbers of employees 
assigned to shift k and starting their first relief break, 
lunch break, and second relief break in period t , respec- 
tively. Relief and lunch breaks must start and be com- 
pleted within the specified time windows. Let Bl k , BL k , 
and B2 k be the sets of planning periods in which an em- 
ployee working shift k may start his/her first relief 
break, lunch break, and second relief break, respec- 
tively. 

The variables U m , W kt2 , and V m are defined for it 
G K and for fl G Bl,, t2 G BL k , and f3 G B2 k , respec- 
tively. Breaks with duration longer than two planning 
periods (e.g., a two- to four-hour idle period allowed 
with split shifts) can be modelled similar to a lunch 
break using W kt . In this case, however, even- break vari- 
able W kt is subtracted from the associated shift variable 
in (5) for the duration of the break (e.g., for a four- 
hour break, subtracted from X k in demand constraints 
for the 16 15-minute planning periods covered by the 
break). Shifts with fewer breaks (e.g., part-time shifts) 
can be modelled easily by excluding the variables and 
the constraint(s) associated with the unauthorized 
break(s). Shifts with more than three breaks can be in- 
cluded by introducing a new set of break variables and 
a new break constraint for even' additional break. Let 
Tl„ TL, and T2, be the sets of shifts for which period t 
is a break start time within the time windows for the 
first relief break, lunch break, and second relief break, 
respectively. Let <?., be one if period t is in the shift span 
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Tabie 1 Optimal Break Start Times for the Second Shift Example 



Number of 
Employees 



Break Times 



sm M Assigned 1st Relief Break U„ Lunch Break W kt 2nd Relief Break V- 



21 



87 



9:15-9:30 (10) 



10:30-10:45 (15) 
10:45-11:00 (16) 
11:00-11:15 (17) 
11:15-11:30(18) 
11:30-11:45 (19) 



10:45-11:15 (16) 
11:45-12:15 (20) 



12:45-13:15 (24) 
13:00-13:30 (25) 
13:15-13:45 (26) 
13:45-14:15 (28) 



14:15-14:30 (30) 



15:15-15:30 (34) 
15:30-15:45 (35) 
15:45-16:00 (36) 
16:00-16:15 (37) 
16:15-16:30 (38) 
16:30-16:45 (39) 



Number of 
Employees 
on Break 



21 
2 
19 
21 
19 
11 
18 
20 
19 
23 
6 
33 
25 
19 
22 
22 
6 
6 

12 



of shift k and zero otherwise. Then, the general shift 
scheduling problem can be formulated as follows: 

minimize £ ^X*, (10) 

subject to 
1 a kl X k - £ U kt - X Wtv-u 

- 1 W kt - X Vkt s= b t for all t G T, (11) 

keTL t keT2, 

Xk- I U kt = 0 for all k e K, (12) 

teB\ k . 

Xa " I W kt = 0 for all k E K, (13) 

teBLi 

x <- I V kt = 0 forall* eK, (14) 

f€B2 t 

Xk, Ukt, Wu, V kt 0 and integer. 

The proposed formulation can be extended to model 
various restrictions on break placements, shift types, 
and objectives. The following constraint, for instance, 



may be added to limit the number of employees taking 
a (relief or lunch) break in period f. 

(15) 

where h t is the maximum number of employees who 
may be on break in period t. This constraint may be 
imposed if the number of employees taking a break in 
a period has to be limited for such reasons as space 
availability (e.g., cafeteria availability), fire regulations, 
or company policy. 

The number of break variables U kt , W kt , and V kt de- 
pends on the length of the break windows. The number 
of break constraints (12)-(14) is usually between 0 and 
2 for a part-time shift, 3 for a full-time shift (for two 
relief and one lunch breaks), and 3 or 4 for a longer or 
a split shift. Thus the number of break constraints 
needed in the proposed formulation is determined by 
the number and types of shifts considered. Let the car- 
dinality of set K be | K | = n K . Also, let \Bl k \ = « u , \BL k \ 
= «uv I B2 k | = n^, and | T | = n T . The number of vari- 
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Table 2 Break Schedule for the Employees in the Second Shift 
Example 



Break limes 












Number of 


Shift 


1st Relief 




2nd Relief 


Employees 


Break 


Lunch Break 


Break 


on Break 


1 




1 Ci-A c 1 1 -1 C 
1 .1 D 


14:15-14:30 


2 


1 


9:15-9:30 


11:45-12:15 


14:15-14:30 


19 


2 


10:30-10:45 


12:45-13:15 


15:15-15:30 


19 


2 


10:45-11:00 


12:45-13:15 


15:30-15:45 


4 


2 


10:45-11:00 


13:00-13:30 


15:30-15:45 


6 


2 


10:45-11:00 


13:15-13:45 


15:30-15:45 


1 


2 


11:00-11:15 


13:15-13:45 


15:30-15:45 


11 


2 


11:00-11:15 


13:15-13:45 


15:45-16:00 


7 


2 


11:15-11:30 


13:15-13:45 


15:45-16:00 


14 


2 


11:15-11:30 


13:45-14:15 


15:45-16:00 


1 


2 


11:15-11:30 


13:45-14:15 


16:00-16:15 


5 


2 


11:30-11:45 


13:45-14:15 


16:00-16:15 


1 


2 


11:30-11:45 


13:45-14:15 


16:15-16:30 


6 


2 


11:30-11:45 


13:45-14:15 


16:30-16:45 


12 



ables needed for a shift k E K in the proposed formu- 
lation is (1 + n xk + n u . + » 2A .) and in the set-covering 
formulation is (n xk n lJk n 2k ) (assuming » u ., Mjut , n 2A . * 0). 
The proposed formulation requires a total of \n K + 

+ "a + n a )l integer variables and \3n K + n T | con- 
straints (assuming three breaks for every shift in K), 
while the set-covering model requires {2 k€K (n lk n u .n 2k )\ 
integer variables for all possible shift and break place- 
ment combinations and n r constraints. 

Note that different shift types do not have to be mod- 
elled following the same approach in the proposed for- 
mulation. If a shift is given only one break with a break 
window, enumeration of all shift variations would re- 
sult in one less variable and no break constraint. Simi- 
larly, if the break times for a shift are predetermined 
then n xk = n Lk = n 2k = 1, and there is no need for break 
variables and constraints for this shift. The number of 
variables and constraints in the proposed model can be 
further reduced by using one of the Equations (12), (13), 
or (14) to replace X, with a set of break variables. This' 
approach, although it reduces the number of variables 
and constraints needed, increases the number of non- 
zeros and the density of the /1-matrix. Our experience 
with this reduced formulation did not show any im- 



provement in the computational effort needed to obtain 
an optimal schedule (Avkin 1995). 

2.3. Upper Bounds 

One can obtain upper bounds for decision variables X, 
from the demand constraint (11). Since X k > U kt , X } 
^ W k! and X«. V kl , an upper bound on X, also provides 
upper bounds on the break variables. If the employees 
working a shift, say k € K t are not gl ven anv break but 

work SL, periods, say periods t = 1 SU, without 

loss of generality, then X, < where fr max = maxffo,, 
■ . . , fts-uK Thus, if X k = b mAX , then employees working 
shift k will be sufficient to meet demand during periods 
t = 1, . . . , SL /; . If the employees are given relief and 
lunch breaks, more employees may be needed during 
the break windows to meet demand and schedule these 
breaks. Suppose the employees assigned to shift k are 
given a one-period relief break with a break window 
Bl k . In a given period within the break window t E B\ k , 
(frma* - b t ) employees are free to take a break. Thus, if 
SfHFu- (b max - b t ) > b max , then a relief break can be sched- 
uled for every employee while meeting demand. Other- 
wise, let (x) be the smallest integer greater than or equal 
to x. Then, 

~ I (i» nwx - j (n ik - l)^ 

more employees will be needed during the break win- 
dow in order for every employee to take his/her break 
and to meet demand. If this is the only break the em- 
ployees assigned to shift k are given, then an upper 
bound for X* is given by 



UBh = 



'W if I Ow - b : ) - / W , 



t£Rlk 



(>hk - 1) ) } otherwise. (16) 



If the employees assigned to shift k are given a break 
lasting two or more periods (e.g., lunch break, idle pe- 
riod for a split shift), an upper bound on XV can be ob- 
tained in a slightly different fashion. We illustrate this 
for a lunch break consisting of two consecutive periods 
with a break window BL k . With b inax employees working 
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shift k, (b mAy - b t ) employees are free to start their lunch 
break in period / G BL : . But since a break of this type 
lasts two periods, these employees can start their lunch 
breaks in period t if at least an equal number of em- 
ployees are also free in period (t + 1) and if (/ + 1) 
€ BL : . Hence, the maximum number of lunch breaks 
that can be scheduled within the lunch window with 
b mAX employees is determined by examining the number 
of employees that are not needed in adjacent periods 
within that window. Let the maximum number of lunch 
breaks that can be scheduled with fr inav employees be 
then a lunch break can be scheduled 
for every employee. Otherwise, 

<(*W - L m , x )/([(n L , + l)/2] - 1)) 

more employees will be needed during the lunch win- 
dow in order for every employee to take his/her lunch 
break. If this is the only break the employees assigned 
to this shift take, then an upper bound on X k is given 
by 

b ma .x + |<(iw x - L mo J/([(n u + l)/2] - 1))) 

otherwise, 
(17) 

where [x] is the largest integer not exceeding x. 

If a break window permits only one break start time, 
then all employees should start their break at that time. 
Consequently, if the demand in that period is not zero, 
the above approach would not provide an upper bound. 

When shift k includes multiple relief and lunch 
breaks, the highest of the bounds found for these breaks 
using (16), and (17) is taken as the upper bound for X k . 
Upper bounds calculated this way were included in the 
proposed formulation as simple bounds. During our 
preliminary tests, these upper bounds have been found 
to be effective in reducing the computational effort 
needed to find an optimal schedule. 

3. Computational Results 

The computational experiments were conducted in two 
parts. In the first part, we studied the effects of break 
scheduling flexibility by analyzing a case reported in the 
literature by Segal (1974). Our results concerning this 



case are presented in §3.2. In the second part, we studied 
the effectiveness of the proposed model bv solving 40 
test problems using the integer programming option of 
LIN DO. The results obtained with the test problems are 
presented in §3.3. The operating conditions assumed in 
our experiments are discussed in §3.1. 

3.1. Scheduling Environment 

The scheduling environment chosen for this study in- 
volves a continuous 24-hour work day (cyclical prob- 
lem). Employee requirements are determined for 96 
15-minute planning periods indexed successively T 
= U, . . . , 96). Duration of a shift is taken as nine hours 
including breaks. Shifts are assumed to start on the hour 
or on the half hour, resulting in 48 shifts in a work day, 
K = {1, . . . , 48). Each employee is given one half-hour 
lunch break and two 15-minute relief breaks. Like in 
Segal (1974), we specify the break windows for each 
shift with reference to ideal break times as follows: the 
ideal start time for the first relief break is specified as 
two hours after the start of a shift, the ideal lunch break 
start time is located in the middle of the work day after 
four hours of work, and the ideal second relief break 
time is specified as after six hours of work. The break 
windows are formed by setting the earliest break start 
times half an hour before the ideal break times. The 
length of break windows is varied to create problems of 
varying size. The eight relief and lunch break window 
combinations considered are shown in Table 3. The re- 
lief break windows considered allow three to six break 
start times (45 minutes to 1.5 hour break windows) and 
the lunch break windows four to six lunch start times 
(1 hour and 15 minute to 1 hour and 45 minute lunch 
windows). In our computational experiments, we con- 
sidered five different demand patterns with the eight 
break window combinations, resulting in 40 test prob- 
lems. Three of the demand patterns were found in the 
literature and are either obtained directly from or re- 
sembling real patterns from service systems: (i) the tri- 
modal demand profile for telephone operators de- 
scribed in Segal (1974) with a minimum quarter hour 
demand of 2 and a maximum of 89 operators, (ii) the 
bimodal demand profile with a minimum quarter hour 
demand of 2 and a maximum of 9 given in Henderson 
and Berry (1976) for the General Telephone and Elec- 
tronics Company of California, and (iii) the unimodal 
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Table 3 



Set Covering versus Proposed Formulation: Problem Characteristics and Memory Requirements 



2 
3 
4 
5 
6 
7 
8 



Number of Variables 



Break 
Window 
Combination n u = rh 



Set 
Covering 



Proposed 
Model 



3 
3 
4 

4 

5 

5 

5 

6 



4 
5 
4 
5 

4 

5 

6 

5 



1.728 
2.160 
3.072 
3.840 
4.800 
6.000 
7,200 
8.640 



528 
576 
624 
672 
720 
768 
816 
864 



Reduction 



69.5 
73.3 
79.7 
82.5 
85.0 
87.2 
88.7 
90.0 



Number of Constraints 



Number of Nonpros in A-Marri* 



Set 
Covering 



96 
96 
96 
96 
96 
96 
96 
96 



; K'on:crc>j r 
A- Main/ 



Proposed 
Model 



Set 
Covering 



Proposed 
Model 



Reduction 



Set 
Coverin 



240 
240 
240 
240 
240 
240 
240 
240 



55.296 
69.120 
98.30^ 
122.880 
1 53.600 
192.000 
230,400 
276.480 



3.024 
3.168 
3.216 
3,360 
3.408 
3.552 
3,696 
3.744 



94.5 
95.4 
96.7 
97.3 
97.8 
98.2 
98.4 
93.6 



33.3 
33.3 
33.3 
33.3 
33.3 
33.3 
33.3 
33.3 



Proposed 
Model 



2.4 
2.3 
2.1 
2.1 
2.0 
1.9 
1.9 
1.8 



demand profile with a minimum quarter hour demand 
of 2 and a maximum of 27 given in Buffa et ah (1976) 
for the General Telephone Company of California. In 
addition to these demand patterns, we considered two 
synthetically generated demand profiles: a bimodal and 
a trimodal sinusoidal demand pattern. Minimum and 
maximum quarter hour demand across these two de- 
mand patterns were 5 and 50, respectively. 

3.2. Effects of Break Scheduling Flexibility 

A number of researchers have demonstrated the effects 
of break scheduling flexibility in shift scheduling (see, 
for example, Bailey and Fields 1985). These studies,' 
however, have remained limited to the cases involving 
shifts with a single break. To illustrate the effects of 
break scheduling flexibility in problems involving mul- 
tiple breaks and break windows, we considered the de- 
mand profile given in Segal (1974) and varied the size 
of the break windows from one break start time (i.e., 
"fixed" break time) to six break start times. We first 
assumed the scheduling environment described in §3.1. 
and solved this problem repeatedly while changing the 
number of break start times. Figure 2A shows the op- 
timal number of employees needed with different lunch 
window sizes when the relief break windows allow one 
or five relief break start times. As can be seen, compared 
to fixed relief break rimes, scheduling flexibility pro- 
vided by five relief break start times lowers the number 
of employees needed significantly, between 6 to 14 em- 
ployees or 4.1% to 8.9%, for all lunch window sizes. 
Figure 2B shows a similar pattern: for a given number 
of lunch break start times, the number of employees 



needed decreases as the numbers of relief break start 
times n lk and n 2k increase. Moreover, Figures 2 A and 
2B show that when the employees are given multiple 
breaks, scheduling flexibility provided by the time 
windows specified for these breaks together affect the 
number of employees needed in the optimal schedule. 
In this case, when all three break times are fixed, 158 
employees are needed. But, when the largest break 
windows are allowed (e.g., n xk - n 2k = 6 and n U: = 5), 
the number of employees needed drops to 141 or 
10.8% less. 

Noting the importance of relief and lunch break 
placements in shift scheduling, Segal (1974) considered 
a set-covering formulation and developed a network 
flow based heuristic for this problem. He also discussed 
a case involving the demand pattern considered above 
and a number of four-, six-, seven-, eight- and nine-hour 
shifts. In this case, regular shifts lasting seven hours or 
longer included one half-hour lunch break and two 15- 
minute relief breaks while the shifts lasting 6 hours in- 
cluded only one half-hour lunch break and part-time 
shifts lasting 4 hours only one 15-minute relief break. 
All breaks had a 1.5-hour break window. The cost of 
assigning an employee to a six-, seven-, eight- or nine- 
hour shift was set to 2 and to a four-hour part-time shift 
to 1 (Segal 1993). He reported that the best solution 
found with his network flow based heuristic in this case 
had a total cost of 320 (only the total cost was reported 
for this solution). In this problem, the set-covering for- 
mulation requires 1478 integer variables and 96 con- 
straints, while the proposed formulation requires only 
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Figure 2 Number of Employees versus Break Window Sizes 

(A) Number of Lunch Stan Times (r? u; ) with n,, = /;.. = 1, 5 

(B) Number of Relief Break Start Times [n u ] with n ti = 1. 5 
162 • 



160 
158 
156 
154 
152 
150 

14* 

146 
144 
142 
140 
138 



Fixed break 
times 



n1k = 1 



n1k = 5 



Optimal 



(A) 



162 
160 
158 
J 156 
i 154 
I 152 

01 

8 150 



2 4 6 

number of lunch start times (nLk) 



10 



12 



Fixed break 
times 



146 
144 
142 
140 
138 



nLk=1 
-+ ^ ( nLk = 5 



Optimal 



(B) 



2 4 6, 

number of relief break start times (n1k) 



10 



12 



282 integer variables and 135 constraints. We solved this 
case with the proposed formulation and obtained an 
optimal schedule with a total cost of 298. Thus the cost 
of the schedule reported in Segal (1974) is 7.4% higher 
than optimal. 

33. Experimental Results with the Test Problems 

The number of variables, number of constraints, num- 
ber of nonzero coefficients, and the density of the A- 
matrix for the set-covering formulation and the pro- 
posed model in the test problems considered are shown 
in Table 3. Compared to the set covering formulation, 
the proposed model requires substantially smaller, be- 



tween 69.5V; to 90% less, number or* variables for all 
break window combinations. This reduction in the 
number of variables is achieved at the expense of three 
break constraints per shift (one per break). Net effect on 
the memory needed in terms of the number of nonzero 
coefficients in the y4-matrix is even more remarkable— 
between 94.5% to 98.6% less than that for the equivalent 
set covering model. Further, note that the longer the 
break windows, the more efficient (compared to the set 
covering formulation) the proposed formulation be- 
comes. In the largest case considered (break window 
combination 8 with n lk = n 2k = 6 and n Lk = 5), the set- 
covering model requires memory for 276,480 nonzero 
coefficients in the ,4-matrix, whereas the proposed 
model for only 3,744. Moreover, the density (percent of 
nonzeros) of the ,4-matrix for the proposed model is 
also substantially lower than the density of the A-matrix 
for the set-covering model: between 1.8% to 2.4% com- 
pared to 33.3% for the set covering formulation. Also 
note that as the break windows provide more flexibility 
with more break start times, the number of variables 
and the number of nonzeros in the proposed model in- 
crease at a much slower rate than for the set covering 
model. The density of the A-matrix, on the other hand, 
decreases. In summary, compared to the set-covering 
formulation, the proposed model for the general shift 
scheduling problem requires substantially smaller num- 
ber of variables, significantly less memory for nonzero 
coefficients, and has a very low density. 

The test problems were formulated using both the 
proposed formulation and the set-covering approach. 
All problems were solved using the integer program- 
ming option of LINDO with limits 20,003 columns, 
14,900 integer variables, 6,503 rows, and 120,000 non- 
zero coefficients running in a batch mode on a VAX8550 
computer. 

Our test runs showed that nonzero break variables 
Uk:, W kt , and V k . tend to be integer when the associated 
shift variables X* have integer values. Therefore, a 
branching strategy giving higher priority to shift vari- 
ables X k was implemented in LINDO. We also included 
the upper bounds calculated using (16) and (17) as sim- 
ple bounds on the decision variables in LINDO. The 
CPU time limit was set to 750 seconds. No attempt was 
made to determine the branching requirements and to- 
tal solution time if LINDO could not locate an integer 
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Table 4 Problem Generation, LP Solution and Integer Programming Solution Times (CPU Wins) 



Break 
Window 
Combination n tt = n-, k 





Average 




Integer Programming Solution 




Problem 


Average LP 










Generation 


Solution 


Min 


Average 


Max 


4 


0.01 


0.43 


1.05 


2.0S 


4.55 


5 


0.01 


0.60 


0.85 


6.27 


19.34 


4 


0.01 


0.48 


1.16 


2.72 


4.14 


5 


0.01 


0.71 


1.42 


5.32 


10.08 


4 


0.01 


0.35 


0.38 


4.64 


15.58 


5 


0.02 


0.48 


0.00 


5.C3 


8.64 


6 


0.02 


0.63 


0.70 


8.32 


17.08 


5 


0.02 


0.54 


1.09 


3.78 


6.63 



1 3 

2 3 

3 4 

4 4 

5 5 

6 5 

7 5 

8 6 



optimal solution to a problem within the CPU time 
limit. Instead, when this happened, a second attempt 
was made choosing variables for branching according 
to the total demand covered by the associated shifts. In 
this attempt, the shift variable X k associated with a shift 
covering higher total demand was assigned a higher 
branching priority. In case of ties, the shift variable with 
the earliest start time was chosen for branching. If a sec- 
ond attempt was needed, the total time for the two at- 
tempts is reported. Thus, the CPU time limit for two 
attempts was 25 minutes. 

The memory needed for the set covering formulation 
exceeded the memory limit of 120,000 nonzeros in all 
but ten cases involving the first two break window com- 
binations. As seen in Table 3, the problems involving 
these break window combinations are the smallest con- 
sidered in this study. Although the problems with break 
window combination 3 with n ]k = n, J: = 4 and n U: = 4 
were inputted successfully into LINDO, the memory re- 
quired consistently exceeded the memory limit while 
solving these problems and the runs had to be termi- 
nated. Out of the 10 test problems (with n lk = n2k = 3 
and n Lk = 4, and n u . = n 2k = 3 and n Lk = 5) that could 
be inputted, only three were solved within the CPU time 
limit in the first attempt. One more case was solved in 
the second attempt; the other six cases remain unsolved. 

The average problem generation times including in- 
put to LINDO and average LP solution times in CPU 
minutes for the proposed formulation are shown in Ta- 
ble 4. The problem generation and input times were the 
same for the problems involving the same break win- 
dow combination. The minimum, average, and maxi- 



mum integer programming solution times in CPU 
minutes for the proposed formulation are also shown 
in Table 4. With the proposed model, out of 40 prob- 
lems, an optimal schedule was obtained in 35 problems 
in the first attempt and in four more in the second at- 
tempt. In only one problem involving n u = n 2k = 4 and 
n Lk = 5 an optimal schedule was not located within the 
allowed time. In the four cases solved with the set- 
covering formulation, solution times were between 4 
and 14 times higher than the corresponding solution 
times with the proposed model. Finally, as seen in Table 
4, with the proposed formulation, the integer program- 
ming solution time is not increasing as the number of 
break start times (break window sizes) increases. This 
is also consistent with our observation concerning the 
values of the break variables; the break variables U kl , 
W kt , and V kf , whose numbers for a shift are determined 
by the numbers of break start times, tend to be integer 
when the associated shift variables X k have integer 
values. 



4. Extensions and Conclusions 

In this paper, we presented a new formulation for the 
general shift scheduling problem with multiple 
breaks and break windows. Compared to the well- 
known set-covering formulation, the proposed model 
substantially reduces the number of variables needed 
at the expense of one additional constraint for each 
break-shift combination. Like the set-covering for- 
mulation, the proposed model is applicable to prob- 
lems involving both 24-hour continuous operations 
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(cyclical problem) and less than 24-hour operations 
(acyclical problem), part-time, full-time, and split 
shifts involving multiple rest and lunch breaks, and 
multiple disjoint break windows. 

We have discussed a number of practical considera- 
tions such as limits on the number of employees taking 
lunch and /or relief breaks in a planning period, and 
split shifts. An important extension of the proposed ap- 
proach would involve its application to the tour sched- 
uling problem (Aykin 1994). The tour scheduling prob- 
lem involves both shift scheduling (including break 
scheduling for every shift) for the work days in a week 
and days-off scheduling. Another area in which more 
work needs to be done is the development and testing 
of specialized algorithms for the proposed formulation. 1 

1 The author is grateful to Professor Awi Federgruen, the associate 
editor, and the reviewers for their constructive suggestions. 
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Abstract 

The labor shift scheduling problem has traditionally been formulated using the set covering approach proposed by 
Dantzig, G.B. [1954. Operations Research 2 (3), 339-341]. The size of the resulting integer model with this approach, 
however, has been found to be very large to solve optimally in most practical applications. Recently, Bechtold, S.E., 
Jacobs, L.W. [1990. Management Science 36 (11), 1339-1351] proposed a new integer programming formulation re- 
quiring significantly smaller number of variables and nonzeros in the /i-matrix than the equivalent set covering for- 
mulation. However, due to its assumptions, this approach has remained limited to the special cases of the shift 
scheduling problem. In Aykin, T. [1996. Management Science 42, 591-603], we presented another implicit modeling 
approach that is applicable to the general problem without any of the limitations of Bechtold and Jacobs (loc. cit.). This 
approach also requires substantially smaller number of variables and nonzeros in the A -matrix than the equivalent set 
covering formulation. In this paper, we relax the assumptions of Bechtold and Jacobs (loc. cit.) and present a gener- 
alized version of it. The extended formulation of Bechtold and Jacobs, although requiring smaller number of variables, 
has more constraints, more nonzeros in the ,4 -matrix, and significantly higher density than the formulation of Aykin 
(loc. cit.). We compare these modeling approaches through solving (optimally) 220 problems involving as many as 
32 928 shift variations. Our computational results show that the time needed to locate an optimal shift schedule and the 
percentage of the problems solved optimally with Aykin's formulation are substantially better than those with the 
formulation of Bechtold and Jacobs. © 2000 Elsevier Science B.V. All rights reserved. 

Keywords: Labor shift scheduling; Manpower management; Service operations management: Implicit modeling; Integer 
programming 
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1. Introduction 

One of the critical tasks the manager of a ser- 
vice system faces daily is the effective scheduling of 
its manpower. In most service systems, demand for 
services changes during the course of an operating 
day. In order to develop a shift schedule, demand 
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needs to be forecasted and converted (usually with 
the help of queuing models) into labor require- 
ments needed to maintain a desired service level 
during the course of an operating day. Employees 
are then assigned to various shifts specified by 
their start times, lengths, number and timing of 
relief and lunch breaks to meet these requirements. 
While understaffing will lower total labor cost, it 
will result in deterioration of the service quality 
and longer waiting times, and consequently, higher 
total cost. Overstaffing, on the other hand, will 
improve the service quality but will result in un- 
derutilization and excessive labor costs. Therefore, 
it is important for a service organization to 
schedule its manpower in an efficient manner to 
minimize labor costs while providing the desired 
service level. 

The shift scheduling problem involves deter- 
mining the number of employees to be assigned to 
various shifts and the timing of their breaks within 
the limits allowed by legal, union, and company 
requirements. To improve manpower utilization 
and lower labor costs, flexibility in shift types, start 
times, length, number and duration of breaks is 
provided. Flexibility in scheduling relief and lunch 
breaks for a given shift is provided with break 
windows specifying the time intervals within which 
employees assigned to that shift must start and 
complete their breaks. Since its introduction by 
Edie (1954), this problem has attracted consider- 
able attention in the literature. Both heuristic and 
integer programming based approaches have been 
proposed. The integer programming based ap- 
proaches have traditionally been based on the set 
covering formulation proposed by Dantzig (1954). 
For nearly four decades, this was the only mod- 
eling approach available for the general problem. 
Although the set covering formulation has been 
studied extensively, it is not a practical approach 
to solve this problem optimally. The set covering 
approach treats every possible shift type, shift start 
time, and break placement combination as a sep- 
arate shift and requires a separate integer variable 
for it. As a result, the size of the integer model 
increases very rapidly with the number of shift 
types, shift start times, breaks, and break window 
sizes, making it impractical to analyze with the 
available integer programming tools. 



Recently, two new modeling approaches have 
been developed by Bechtold and Jacobs (1990), 
and Aykin (1996). Both studies model break 
placements implicitly to reduce the size of the in- 
teger model. The objective of this study is to 
comparatively evaluate these two modeling ap- 
proaches to the general shift scheduling problem in 
a variety of problem situations. For this purpose, 
we first relax a number of assumptions made in 
Bechtold and Jacobs (1990) and present a gener- 
alized version of their formulation that was used in 
the computational experiments reported. The 
modeling approaches are compared using a set of 
220 test problems involving as many as 32 928 shift 
variations. The problem set considered in this 
study include, to the best of our knowledge, the 
largest shift scheduling problems solved optimally 
in the literature. Our results show that the for- 
mulation given in Aykin (1996) requires more 
variables than the formulation of Bechtold and 
Jacobs but substantially less than the set covering 
model. The number of constraints, number of 
nonzeros and the density of the ^-matrix with the 
approach given in Aykin (1996) are, however, 
substantially smaller than those for the formula- 
tion given in Bechtold and Jacobs, and the set 
covering model. More importantly, our results 
show that the time needed to locate an optimal 
shift schedule and the percentage of the problems 
solved optimally with Aykin's formulation are 
substantially better than those with the formula- 
tion of Bechtold and Jacobs; while no optimal 
solution could be found in 32 (14.5%) problems 
with the formulation of Bechtold and Jacobs, only 
2 problems remained unsolved with the formula- 
tion of Aykin. The average IP solution time for the 
problems solved by both modeling approaches was 
331.36 seconds with the formulation of Bechtold 
and Jacobs, and 113.04 seconds (about 66% less) 
with the formulation of Aykin. The fact that these 
results were obtained with a problem set including 
the largest shift scheduling problems solved opti- 
mally in the literature using an off-the-shelf integer 
programming software (LINDO) is very encour- 
aging in terms of the practical applications of im- 
plicit formulations in service systems. 

The remainder of the paper is organized as 
follows. In Section 2, we introduce the set covering 



T. Aykin I European Journal of Operational Research 125 (2000) 381-397 



383 



formulation, extend the formulation given in 
Bechtold and Jacobs into the scheduling environ- 
ments involving 24-hour continuous operations, 
multiple relief/lunch breaks, and multiple dis- 
joint break windows, and present the formulation 
given in Aykin (1996). Computational results ob- 
tained with 220 test problems are presented in 
Section 3. Finally, in Section 4, we summarize our 
results. 



2. Modeling approaches to the shift scheduling 
problem 

This section introduces the notation used and 
presents the three modelings available for the 
general shift scheduling problem. 



2.7. Set covering model of Dantzig ( 1954) 

The following set covering formulation was 
introduced by Dantzig (1954): 

Minimize ^c k X k (1) 
subject to 

Y^<*ktX k > b t for all t e 7\ 

k€K 

X k ^ 0 and integer, k e AT, (2) 

where X k is an integer variable defined as the 
number of employees assigned to shift k, T is the 
set of planning periods covered by the shift 
schedule, K is the set of all shifts, b, is the number 
of employees needed in period t to achieve the 
desired service level, c k is the cost of assigning an 
employee to shift k, and a kt is equal to one if 
period / is a work period for shift k and zero 
otherwise. 

With this formulation, all shift type, shift 
start time, and break placement combinations are 
enumerated and included, as separate shifts, in 
set K. 



22. Implicit modeling approach of Bechtold and 
Jacobs (1990) 

Recognizing the difficulties caused by the enu- 
meration of all shift variations in the set covering 
model and making use of the fact that not all in- 
formation concerning the break placements in- 
cluded in that model is necessary to develop an 
optimal shift schedule, Bechtold and Jacobs (1990) 
developed an implicit formulation of the problem 
that is substantially smaller than the equivalent set 
covering model. Although the implicit modeling 
idea itself is not new in the manpower scheduling 
literature (see, for example, Moondra, 1976; 
Gaballa and Pearce, 1979), their formulation uses 
a novel approach for modeling break placements 
implicitly. Bechtold and Jacobs (1990) compared 
their formulation with the set covering model ex- 
perimentally and reported superior results with 
their approach. 

Unfortunately, due to its assumptions, this 
formulation is limited to certain instances of the 
problem and cannot be applied to the general case 
in its present form. The following assumptions are 
listed in Bechtold and Jacobs (1990, p. 1341): (1) 
the system operates less than 24 hours daily, (2) 
planning periods are equal in length, (3) each shift 
is given a single break, (4) the break duration is 
identical for all shifts, (5) the break duration is one 
or more periods, (6) each shift has a single break 
window associated with it's workspan, (7) breaks 
should start and end during the shifts, (8) no ex- 
traordinary break window overlap exists (this 
condition exists when the break window of one 
shift is a strict subset of the break window of an- 
other, and has different starting and ending peri- 
ods than the larger window), and (9) no 
understaffing is allowed. Besides these, Bechtold 
and Jacobs (1990) also assume that every break 
window consists of contiguous planning periods 
(i.e. no split windows). They noted in their study 
that assumptions (2), (5), and (7) are consistent 
with the scheduling practice and are common in 
the literature, and (3) and (9) can be relaxed by 
modifying their formulation. With assumptions (4) 
and (6), however, their formulation would not be 
appropriate for a shift scheduling problem in- 
volving multiple relief and lunch breaks with 
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unequal duration (e.g. two 15-minute rest breaks 
and one 30-minute lunch break arrangement 
found in most service systems) and multiple dis- 
joint break windows associated with them. Fur- 
thermore, with assumption (1), their approach is 
limited to the problems involving shifts starting 
and ending during the same 24-hour period, and 
with assumption (8) to the cases not involving any 
extraordinary break window overlaps. Recogniz- 
ing these limitations, after discussing the effects of 
extraordinary break window overlaps, they state in 
the summary and conclusions section of their 
study that 

"Numerous research opportunities exist for 
the extension of the implicit modeling ap- 
proach for use in other operating environ- 
ments. For example, shift scheduling for 
telephone operators is typically done across 
24 hour planning horizons. Moreover, shift 
scheduling environments exist in practice for 
which the organization desires to include opti- 
mal placement of rest breaks as well as meal 
breaks in shift assignments (Hendersen and 
Berry, 1976)" (Bechtold and Jacobs, 1990, 
p. 1350). 

The study cited by Bechtold and Jacobs (1990) 
in the above paragraph (Henderson and Berry, 
1976), considers a shift scheduling problem in- 
volving full-time shifts with two relief breaks and 
one lunch break with unequal durations and 
multiple disjoint break windows. Since, this type of 
shifts are found in most practical applications, we 
assumed the same scheduling environment to 
compare the two modeling approaches. Moreover, 
since the case involving 24-hour continuous oper- 
ation (cyclical environment) has received consid- 
erable attention in practice and in the literature 
(see, for example, Baker, 1976; Bartoldi, 1981; 
Bartoldi et aL 1980; Bedworth and Bailey, 1987; 
Vohra, 1988; Nanda and Browne, 1992, among 
others), we considered both cyclical and acyclical 
(less than 24-hour operation) scheduling environ- 
ments. Solving shift scheduling problems in the 
scheduling environments considered with the for- 
mulation of Bechtold and Jacobs (1990), however, 
requires the relaxation of assumptions (1), (3). (4). 



and (6). Below, we present the modifications 
needed to relax these assumptions. The modifica- 
tions needed to relax the assumptions concerning 
the extraordinary break window overlaps (as- 
sumption 8) and contiguous break windows, and 
extend this approach to the general tour schedul- 
ing problem (solving daily shift and days-off 
scheduling together) can be found in Aykin 
(1995a). Let T be the set of planning periods 
covered by the shift schedule, and K be the set of 
shifts including all feasible combinations of dif- 
ferent shift types and start times. Define X k as an 
integer variable representing the number of em- 
ployees assigned to shift k, k e K. To model the 
placement of breaks of a particular type (e.g. first 
relief break taken before the lunch break) in an 
environment involving 24-hour continuous oper- 
ations (acyclical case is discussed later in this sec- 
tion), one needs to distinguish shifts with a break 
window starting and ending within the same day 
(for convenience, assume that a work day starts at 
7:00 a.m. and ends 7:00 a.m. the following day) 
from those with a break window starting in one 
day but continuing into the following day. Con- 
sider the first relief break windows for the shifts in 
K. Let KU1 be the set of shifts with a first relief 
break window starting and ending within the same 
day, and KU2 be the set of shifts with a first relief 
break window starting in one day and continuing 
into the following day (note that K = 
KU1UKU2). In the extended formulation, first 
relief breaks for the shifts in these two sets are 
scheduled separately by defining separate sets of 
first relief break variables and introducing two sets 
of break placement constraints. Let U\ t be an in- 
teger variable representing the number of em- 
ployees assigned to shifts in KU1 and taking their 
first relief breaks in period /, and U2, be an in- 
teger variable representing the number of em- 
ployees assigned to a shift in KU2 and taking 
their first relief break in period t. In this case, first 
breaks for the shifts in KU1 can be scheduled 
within their break windows by using the forward 
pass and backward pass constraints given in 
Bechtold and Jacobs (1990). In the cyclical 
scheduling environment considered, a similar set 
of constraints is needed for the shifts in KU2 in 
order to schedule correct number of first relief 
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breaks within their break windows. The number 
of constraints added for the shifts in KU2 is 
usually small and is determined by the length of 
the break window. When an optimal solution to 
the extended formulation is obtained, break times 
can be assigned to shifts by applying the simple 
break scheduling procedure given in Bechtold and 
Jacobs (1990) to each set separately. Thus, the 
following two sets of constraints are needed for 
scheduling the first relief breaks in the extended 
formulation. 

£ t/i,- £ 

/€SUly A€TU1I, 

for ally € NU1 - {qui}, (3) 

£ Uh - £ > 0 
forallyeMUl-{pul}, (4) 

E^-=E^ (5) 

A'tKUl /€TUI 

£t/2,- £ X k >Q 

(6SU2, A-eTU!2, 

for all j G NU2 - {qu2}, (6) 
Y U2, - Y x « > 0 

(6ZU2, ISTU22, 

forallyeMU2-{pu2} ) (7) 

keKUl ieTV2 

where for the first relief breaks for the shifts in 
KU1, we define 

pul earliest first relief break start time for the 

shifts in KU1 
qui latest first relief break start time for the 

shifts in KU1 
TU1 {pul, ■ • . ,qul}, the set of all possible first 

relief break start times for the shifts in 

KU1 



3S5 

SU1 7 {pul,. . . ,y}, the set of all first relief break 

start times between periods pul and j 
ZU1, {/'...., qui }, the set of all first relief break 

start times between periods j and qui 
NU 1 set of latest first relief break start times (in 

ascending order) for the shifts in KU1 
MU1 set of earliest first relief break start times 

(in ascending order) for the shifts in KU1, 
TU1 1, the set of shifts in KU1 with a first relief 

break window lying completely between 

periods pul and j, and 
TU21, the set of shifts in KU1 with a first relief 

break window lying completely between 

periods j and qui 



In a similar fashion, for the shifts in KU2, we 
define the parameters and the sets pu2, qu2, TU2, 
SU2 /} ZU2 y , NU2, MU2, TU12,-, and TU22 y . 

To model lunch break placements correctly, 
define, similar to KU1 and KU2 defined for the 
first relief breaks above, the sets KW1 and KW2 
(A:-KW1UKW2). Also define the integer 
variables W\ t and Wl t as the number of em- 
ployees assigned, respectively, to shifts in sets 
KW1 and KW2 and starting their lunch breaks 
in period t. The following two sets of constraints 
are needed to schedule lunch breaks within the 
correct break windows for the shifts in KW1 and 
KW2. 

/<=SWly A-eTWii, 

for all j GNW1 ~{qwl}, (9) 

/eZWl; A-=TW2 1 j 

for all je MW1 - {pwl}, (10) 

(€SW2j t-iTWI2 / 

for all j e NW2 - {qw2}. (12) 



386 



T. Aykiti f European Journal of Operational Research 125 (2000) 381-397 



iesv\j AeTVil, 
(13) for ally €NV1 -{qvl}, (15) 



Y, W2 >- E > o 

i€ZW2j *<=TW22, 

for all ye MW2- {pw2}, 

£^=X> 2 " (14) 

where for the lunch breaks for the shifts in KW1, 
we define 

pwl earliest lunch break start time for the 

shifts in KW1 
qwl latest lunch break start time for the shifts 

in KW1 

TW1 {pwl, . . . .qwl}, the set of all possible 
lunch break start times for the shifts in 
KW1 

SVVly fpwl, . . . J), the set of all lunch break 

start times between periods pwl and j 
ZWl y {/,..., qwl}, the set of all lunch break 

start times between periods j and qwl 
NW1 set of latest lunch break start times (in 

ascending order) for the shifts in KW1 
MW1 the set of earliest lunch break start times 

(in ascending order) for the shifts in 

KW1 

TWlly set of shifts in KW1 with a lunch break 
window lying completely between peri- 
ods pwl and y, and 

TW21, the set of shifts in KW1 with a lunch 

break window lying completely between 
periods j and qwl 



forallyeMVl-{pvl}, (16) 



2>*=£ F1 " (17) 
teKVl /eTVi 



E V2 >- E x *>° 

ieSV2j ASTV12, 

forally€NV2-{qv2}, (18) 



(SZV2, *eTV22, 

forally<=MV2-{pv2}, (19) 



X>=£>2„ ( 20 ) 

/GTV2 



where for the second relief breaks for the shifts in 
KV1, we define 



Similarly, define pw2, qw2, TW2, SW2„ ZW2 y , 
NW2, MW2, TW12,, and TW22 y for the lunch 
breaks for the shifts in KW2. 

Above, constraints (9)-(l 1) and (\2)-(\4) ensure 
that two sets of lunch break times are scheduled for 
the shifts in KW1 and KW2 separately. 

To schedule the second relief breaks, define the 
sets KV1 and KV2 (K == KV1 U KV2). Also define 
the integer variables V\ t and V2 t as the number of 
employees assigned, respectively, to shifts in KV1 
and KV2 and taking their second relief breaks in 
period t. The following two sets of constraints are 
needed to schedule second relief breaks within the 
correct break windows for the shifts in KV1 and 
KV2. 



pvl earliest second relief break start time for 

the shifts in KV1 
qvl latest second relief break start time for the 

shifts in KV1 
TV1 {pvl, . . . .qvl}, the set of all possible 

second relief break start times for the 

shifts in KV1 
SVI 7 [pvl, . . . J), the set of all second relief 

break start times between periods pvl 

and j 

ZVly {/, .... qvl}, the set of all second relief 
break start times between periods j and 
qvl 

NV1 set of latest second relief break start times 
(in ascending order) for the shifts in KV1 
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MV1 set of earliest second relief break start 

times (in ascending order) for the shifts in 
KV1 

TV1 lj set of shifts in KV1 with a second relief 

break window lying completely between 

periods pvl and j, and 
TV21, set of shifts in KV1 with a second relief 

break window lying completely between 

periods j and qvl 

Similarly, define pv2, qv2, TV2, SV2 7 , ZV2„ NV2, 
MV2, TV12/, and TV22, for the second relief 
breaks for the shifts in KV2. 

Constraints (15)-(17) and (18)-(20) above en- 
sure that two sets of second relief are scheduled 
within the break windows specified for the shifts in 
KV1 and KV2 separately. 

We now present the extended formulation of 
Bechtold and Jacobs for a cyclical case involving 
multiple rest and lunch breaks, and disjoint break 
windows. Let c k be the cost of assigning an em- 
ployee to shift k, k € K, a ki be equal to one if 
period t is within the workspan (a work or a 
break period) of shift k, and zero otherwise, and 
h t be the number of employees needed in period t. 
Then, the extended formulation can be stated as 
follows: 

Minimize ^c k X k (2 1 ) 

kefc 

subject to 

k€K y'eTUl yeTWl 

y'eTVl /cTU2 
/6TW2 y€TV2 

for all t e r, 

constraints (3)-(20), (22) 
X k , U\ t , U2 t , Wl gi W2 n V\ t , V2 t > 0 and integer. 



where 

vljt 1 if period t is a first relief break period for 
the employees starting their breaks in peri- 
od h J ^ TU1, and zero otherwise. 

oil,, 1 if period / is a lunch break period for the 
employees starting their breaks in period j, 
j € TW1, and zero otherwise, 

vlj t 1 if period t is a second relief break period 
for the employees starting their breaks in 
period j, j € TV1, and zero otherwise, 

and D2 Jn co2j ti and v2„ are defined similarly for the 
shifts in TU2, TW2, and TV2, respectively. 

In the above formulation, constraint (22) en- 
sures that the number of employees assigned to 
various shifts and available in period / minus the 
number of employees taking their first relief, 
lunch, and second relief breaks is sufficient to meet 
demand and provide the desired level of service in 
that period. 

The equality constraints (5), (8), (11), (14), (17), 
and (20) may be substituted in the associated 
break constraints in a manner similar to Bechtold 
and Jacobs (1990) to reduce the number of nonz- 
eros whenever appropriate. Our preliminary tests 
with this reduced form, however, did not show any 
computational advantage over the basic model. 

Note that if the scheduling problem is an 
acyclical one involving multiple relief and lunch 
breaks and break windows, then KU1 = K, 
KW1 =K, and KV1 =K, with KU2 ~ KW2 = KV2 
= 0. Hence, variables U2 ti 1Y2 I9 and K2„ and 
constraints (6)-(8), (12)-(14), and (18)-(20) are not 
needed in the above formulation. 

Recently, Thompson (1995) applied the for- 
mulation of Bechtold and Jacobs together with the 
implicit representation of shifts described in 
Moondra (1976) in a shift scheduling environment 
involving a high degree of shift length and start 
time flexibility. Given a set of shift start times and 
a set of shift lengths associated with every shift 
start time (e.g. shifts with lengths 6, 6.5, . . ., 9.5, 
and 10 hours are assumed to be available to start 
every 30 minutes throughout the day). Moondra's 
approach was used to model shifts with varying 
lengths and start times, and the modeling ap- 
proach of Bechtold and Jacobs to model break 
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placements implicitly to reduce the size of the 
formulation. The extension presented in Thomp- 
son (1995), however, has a number of limitations. 
First, it assumes that a high degree of flexibility in 
terms of both the shift start times and shift lengths 
is available. In practice, usually the company, 
union, and legal requirements are followed in de- 
termining shift start times together with shift 
lengths (see, for example, Segal, 1974; Henderson 
and Berry, 1976; Buffa, 1980; Bedworth and Bai- 
ley, 1987; Love and Hoey, 1990; Nanda and 
Browne, 1992; Spencer et al., 1992). Hence, except 
for such systems as fast food restaurants, the de- 
gree of combined shift length and shift start time 
flexibility assumed is not common in practice. If a 
high degree of shift length flexibility is not avail- 
able with most of the shift start times, on the other 
hand, the formulation presented in Thompson 
(1995) would offer little advantage over the for- 
mulation of Bechtold and Jacobs. For instance, in 
a problem involving only full-time employees (e.g. 
Henderson and Berry, 1976), or in a problem with 
alternative shift lengths that are available only at 
certain shift start times (e.g. Segal, 1974), the for- 
mulation given in Thompson would offer little (if 
any) advantage over the formulation of Bechtold 
and Jacobs. Another limitation of the formulation 
given in Thompson (1995) is that the break win- 
dows for a shift type are defined with reference to 
the shift start and finish times by specifying the 
minimum and maximum pre- and post-break work 
periods. As a result, even when two shifts share 
similar attributes (e.g. same break duration, even 
the same start time), the length of the break win- 
dow decreases with the shift length. In the illus- 
trative example given in Thompson (1995, Section 
2.2.), for example, while the break window is 
3 hours long for a 7-hour shift, the length of the 
break window decreases to 2 hours for a 6-hour 
shift, and to 1 hour (that is, a "fixed" meal break 
time for a 1-hour meal break) for a 5-hour shift 
even when they all start at the same time. In 
practice, start time and length of a shift determine 
the number and type of breaks (e.g. rest, lunch, 
idle period for a split-shift) employees receive. 
Together with them, break windows are specified 
with their lengths and position with respect to 
some other shift features (e.g. start time, "ideal" 



break time, etc). As reported extensively in the 
literature (see, for example, Segal, 1974; Hender- 
son and Berry, 1976; Bedworth and Bailey, 1987; 
Bechtold and Jacobs, 1990; Nanda and Browne, 
1992; Jarrah et ah, 1994), the length and posi- 
tioning of a break window usually do not change 
with the shift start time or length. This way the 
degree of break scheduling flexibility provided by 
break windows remains the same for all shift types 
to better match demand, improve manpower uti- 
lization, and provide timely relief to employees. 
This type of scheduling environment cannot be 
modeled efficiently with the formulation described 
in Thompson (1995). Also, the minimum shift 
length for a particular shift type, and the minimum 
pre- and post-break work periods should be ex- 
amined carefully to avoid infeasibility. 

Another limitation of the formulation presented 
by Thompson is its inability to model a realistic 
cost structure accurately. The formulation pre- 
sented assumes that the cost of assigning an em- 
ployee to a shift is a linear function of only its 
duration and not whether it is a full-time or a part- 
time shift, its start time or the time periods it covers 
(e.g. day shift vs. night shift). As also stated in 
Thompson (1995), to model a realistic cost struc- 
ture accurately, a formulation with explicit shift 
representation is needed. And, finally, like the for- 
mulation of Bechtold and Jacobs, the formulation 
given in Thompson is limited to the cases covering 
less than 24 hours (i.e. acyclical problem). In 
summary, the formulation presented in Thompson 
(1995) is applicable to special cases satisfying these 
conditions and cannot be applied to the general 
shift scheduling problem in its present form. 
Therefore, this extension of Bechtold and Jacobs 
(1990) is not considered in this study. 



2.3. Implicit formulation of Aykin (1996) 

In a recent study (Aykin, 1996), we presented 
another implicit formulation for the general shift 
scheduling problem. Like Bechtold and Jacobs 
(1990), Aykin (1996) is modeling the break place- 
ments implicitly. The approaches used in these two 
studies, however, are different in the way the 
breaks times are matched with the shifts. While the 
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formulation given by Bechtold and Jacobs (1990) 
uses break variables associated with each planning 
period and matches breaks with the shifts implic- 
itly, the formulation given in Ay kin (1996) defines 
separate break variables for each shift. This ap- 
proach extends the implicit break representation 
idea introduced in Gaballa and Pearce (1979) to 
the general case involving multiple rest and lunch 
breaks and multiple disjoint break windows. In the 
special case considered by Gaballa and Pearce, this 
approach required more variables and constraints 
than the equivalent set covering formulation. 
Therefore, their approach has not been generalized 
or considered any further in the literature. Aykin 
(1996) showed that although apparently not ben- 
eficial in the special case considered by Gaballa 
and Pearce (1979), this approach is surprisingly 
very successful in reducing the number of variables 
needed in the general shift scheduling problem. 
The number of nonzeros and the density of the A- 
matrix are also much lower than the set covering 
formulation. Besides these, the new formulation 
makes the same assumptions with the set covering 
formulation and is applicable to all cases that the 
set covering approach is applicable. 

To formulate the problem using this approach, 
we define K as the set of all shifts including all 
feasible combinations of shift start times and 
lengths, and X k as the number of employees as- 
signed to shift k. Let U kl , W kt , and V kl be the break 
variables representing the number of employees 
assigned to shift k and taking their first relief, 
lunch, and second relief breaks in period t, re- 
spectively. Let B\ k , BL k , and B2 k be the sets of 
planning periods forming the first relief, lunch, and 
second relief break windows of shift k, respec- 
tively. In this formulation, the first relief break 
variable U kt is defined only for t e B\ k , lunch 
break variable W k( for / e BL ki and the second 
relief break variable V kt for t e B2 k . Further, define 
ri„ 7L„ and 72, as the sets of shifts for which 
period / is a break start time within the break 
windows for the first relief, lunch, and second re- 
lief break, respectively. Also, let a kl be one if period 
t is in the shift span (a work or a break period 
during the shift) of shift k and zero otherwise. The 
general shift scheduling problem can then be for- 
mulated as follows: 



Minimize ^c k X k 



(23) 



subject to 

£ t/*- 51 w *<-" 

keK ken, kzTL{t-\) 

-XX-E^'^' fcrallrer, (24) 

A-£7L r ken, 

X t - 53% = 0 foralUetf, (25) 
X, -53^< = 0 for-dllkeK, (26) 
X t - 53 V ta = 0 for all k € K, (27) 

t<=B2i 

X n U kt , W kn V kt ^ 0 and integer. 

In the above formulation, break windows are 
specified by B\ ki BL k , and B2 k . In the extended 
formulation of Bechtold and Jacobs presented 
before, break windows determine the earliest and 
latest break start times (i.e. pul, qui, etc.) and 
the break placement requirements in the break 
constraints. Except for the no-extraordinary 
break window overlaps and contiguous break 
window requirements of Bechtold and Jacobs 
(1990) (both are relaxed in Aykin, 1995a,b), the 
extended formulation of Bechtold and Jacobs 
given above and the implicit modeling approach 
given by Aykin are equivalent to the set covering 
formulation. 

Also note that the number of variables and the 
number of constraints in the above formulation 
can be reduced by substituting one of the break 
constraints (25)-(27) associated with each shift in 
the other break constraints and the demand con- 
straints (24). Our tests with this reduced form, 
however, did not show any computational ad- 
vantage over the basic model in terms of model 
reliability and solution time. 
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3. Experimental evaluation 

In this section, we describe the shift scheduling 
environment considered in the computational ex- 
periments, discuss the characteristics of the im- 
plicit formulations, and report our results with 
220 problems. 

3.1. Scheduling environment 

We varied the scheduling environment consid- 
ered in the test problems along four dimensions: 
relief and lunch break window sizes, shift start 
time pattern, cyclical-acyclical operations, and 
demand pattern. Employee demand is assumed to 
be determined for 96 quarter-hour (= 4 x 24 hours) 
planning periods, T = {1, .... 96}. Concerning the 
shift start times, we considered two schemes; (i) 
9-hour shifts starting on the hour or half-hour, 
and (ii) 9-hour shifts starting every 15 minute. 
Each employee is assumed to be given one 
30-minute lunch break and two 15-minute relief 
breaks (one before and one after the lunch break). 
We specify the break windows with reference to 
"ideal" break times as follows: the ideal first relief 
break time for a shift is specified as 2 hours after 
the start of the shift, the ideal lunch start time is 
4 hours and 15 minutes (4 hours of work plus the 
first relief break) after the start of the shift, and 
the ideal second relief break time is 6 hours 
and 45 minutes"(6 hours of work plus the first relief 
and lunch breaks) after the start of the shift. The 
break windows are specified by setting the earliest 
break start times half an hour earlier than the ideal 
break start times. To test the modeling approaches 
in situations with varying difficulty level, we 
considered 1 1 break window length combinations 
allowing between 4 and 7 break start times for the 
relief breaks (1 hour to 1 hour and 45 minutes 
windows) and between 3 and 7 lunch break start 
times (1-2 hour windows). Together with these, we 
considered five demand patterns (Fig. 1). The 
following three demand patterns were reported in 
the literature and were either obtained from or 
resembling real patterns from service systems: 
(i) the trimodal demand profile given in Segal 
(1974) for telephone operators with a minimum 



quarter hour demand of 2 and a maximum of 89, 
(ii) the bimodal demand profile with a minimum 
quarter hour demand of 2 and a maximum of 9 
reported by Henderson and Berry (1976) for the 
General Telephone and Electronics Company of 
California, and (iii) the unimodal demand profile 
with a minimum quarter hour demand of 2 and a 
maximum of 27 given in Buffa et al. (1976) for the 
General Telephone Company of California. The 
two other demand patterns were synthetically 
generated; a bimodal and a trimodal sinusoidal 
demand pattern with a minimum quarter hour 
demand of 5 and a maximum of 50. We generated 
a total of 220 problems by considering both cy- 
clical and acyclical versions of the combinations of 
these factors and their levels. 



3.2. Model size characteristics 

The number of variables, number of con- 
straints, number of nonzeros and the density of the 
,4-matrix for the implicit formulations of Bechtold 
and Jacobs (1990), Aykin (1996), and the set cov- 
ering formulation are shown for some represen- 
tative cases (from the smallest to the largest 
considered in this study) in Tables 1 and 2, re- 
spectively. In these tables, n K is the number of shift 
start times in K y and n\ ki n2 ki and nL k show the 
number of break start times in the first relief, 
second relief, and lunch break windows, respec- 
tively. As can be seen from Table 1, in the acyclical 
problems considered, the number of variables (that 
is, the number of shift variations) in the set cov- 
ering model varies between 1984 and 20923. In 
contrast, the extended implicit modeling ap- 
proach of Bechtold and Jacobs requires between 
223 and 262, and the implicit modeling approach 
of Aykin requires between 403 and 1342 vari- 
ables. In every case, the extended formulation of 
Bechtold and Jacobs requires the smallest num- 
ber of variables. Aykin's approach requires more 
than the extended formulation of Bechtold and 
Jacobs but significantly less than the set covering 
formulation. The formulation of Aykin, on the 
other hand, has a substantially smaller number 
of nonzeros in the A -matrix than the other two 
approaches: while the set covering formulation 
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Fig. 1. Demand patterns: (A) Segal (1974).. (B) Henderson and Berry (1976), (C) Buna et al. (1976), (D) bimodal demand pattern, and 
(C) trimodal demand pattern. 



has between 63 488 and 1053 696 nonzeros and 
the formulation of Bechtold and Jacobs has be- 
tween 10 387 and 56124 nonzeros in the vl-ma- 
trix, the formulation of Aykin (1995a ? b) has 
between 2077 and 8448 nonzeros (i.e. about 80% 
less than the equivalent formulation of Bechtold 
and Jacobs, and 96.7-99.2% less than the 
equivalent set covering formulation). Further- 
more, while the density of the A -matrix of Ay kin's 
formulation varies between 1.5% and 2.9%, it is 
between 16.8% and 22.4%, for the ,4-matrix of 
the extended formulation of Bechtold and 
Jacobs, and 33.3% for the set covering formula- 
tion. Note that since, in the set covering model, 
all shift variations are enumerated, only demand 



constraints are needed. As can be seen from 
Table L the implicit modeling approaches 
require more constraints in order to assure 
correct break placements. Similar conclusions can 
be reached from the results presented in Table 2 
for the cyclical problems. 

As discussed before, the implicit formulations 
of this problem are taking advantage of the fact 
that only some of the information about the break 
times generated by the set covering formulation is 
needed in the model to obtain an optimal schedule. 
This is true since, in general, the shift scheduling 
problem has a large number of alternative optimal 
solutions. In most cases, given an optimal solution, 
one can generate alternative optimal schedules by 
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Table 3 






oummary oi computational expeninenis 








Bechtold and Jacobs 


Aykin 


Total number of problems considered 


220 


220 


Number of problems remained unsolved 






After 1st pass 


45 


15 


After 2nd pass 


32 


2 


Number of problems remained unsolved with both approaches 






After 1st pass 


11 


11 


After 2nd pass 


1 


1 


Number of problems with same CPU time 3 


4 


4 


Number of problems with a better CPU time 


12 (5.5%) 


203 (92.3%) 


Average 6 CPU time (s) per problem 


331.36 (188) 


113.04 (218) 


Lower bound on CPU time (s) per problem 


504.85 


200.53 



a The difference between the solution times ^0.5 s. 
b After two passes. 



switching the breaks of different shifts that have 
overlapping break windows, provided this does 
not violate any of the demand or break con- 
straints. Consequently, the set covering, and the 
implicit modeling approaches are three possible 
strategies to solve this problem; as the set cover- 
ing formulation requires, one may enumerate all 
rest and work patterns and determine only one of 
these optimal solutions that provides a complete 
schedule or, as in the extended formulation of 
Bechtold and Jacobs (1990), an optimal solution 
specifies only the active shifts and sufficient 
number of breaks,and a complete schedule is 
obtained from this solution by postprocessing. As 
a third strategy, the implicit formulation of Aykin 
(1996) is providing an approach that is taking 
advantage of the fact that one can formulate the 
problem to obtain slightly more information in 
the optimal solution than Bechtold and Jacobs 
(1990) (but still less than the set covering for- 
mulation) by scheduling shifts and breaks asso- 
ciated with each shift separately. Thus, the 
formulation of Bechtold and Jacobs (1990) is 
more compact in terms of the number of vari- 
ables. The primary advantage of Aykin (1996), on 
the other hand, is that it requires significantly 
smaller number of constraints and nonzeros in 
the A-matrix than Bechtold and Jacobs (1990). 
As discussed in the computational results section, 
these approaches are significantly different in 
the amount of computational effort needed and 
reliability. 



3.3. Experimental results 

All 220 problems were formulated using the 
implicit modeling approaches of Bechtold and 
Jacobs (1990) and Aykin (1996), and were solved 
using LINDO (Version 5.3.) together with a 
FORTRAN interface that generates and inputs 
problem data. All runs were completed in batch 
mode on a Sun Sparc 512 computer. Since the 
superiority of both implicit modeling approaches 
to the set covering formulation has been demon- 
strated separately with computational experiments 
in Bechtold and Jacobs (1990), and Aykin (1998), 
the set covering formulation was not considered in 
our computation experiments. We also note that 
the memory requirements (e.g. number of vari- 
ables) for the set covering formulation of most of 
the problems considered in our study would exceed 
the limits of the software/hardware combination 
used. 

Both Bechtold and Jacobs (1991) and Aykin 
(1996) reported that the break variables in their 
formulations tend to be integer when the shift 
variables are integer. Therefore, a branching 
strategy giving higher priority to the shift variables 
X k was implemented in LINDO. The CPU time 
limit was set to 1200 seconds. No attempt was 
made to determine the branching requirements 
and total solution time if LINDO could not locate 
an integer optimal solution to a problem within 
the CPU time limit. As seen in Table 3, this was 
the case in 45 (20.5%) out of 220 problems with the 
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extended formulation of Bechtold and Jacobs, and 
in 15 (6.8%) out of 220 problems with the ap- 
proach of Aykin. These cases include the 11 
problems in which both approaches failed to lo- 
cate an optimal integer solution within the allowed 
time. In order to gain more information, we made 
a second attempt using a branching strategy giving 
a higher priority to the variables associated with 
the shifts covering higher total demand during its 
shift span. In case of a tie, the variables associated 
with the shifts with an earlier start time were 
chosen for branching. If a second attempt was 
made, the total time for both passes is reported. 
Thus, the CPU time limit for two passes is 2400 
seconds. Out of 45 problems which remained un- 
solved after the first pass with the approach of 
Bechtold and Jacobs, 13 more solved in the second 
pass. With the approach of Aykin, on the other 
hand, out of 15 not solved during the first pass, 
only 2 problems remained unsolved after the sec- 
ond pass. In summary, after two passes, 32 prob- 
lems with the extended formulation of Bechtold 
and Jacobs, or about 14.5% of the problems, and 2 
problems with the formulation of Aykin, or about 
0.9%, remained unsolved within the allotted time. 

As can be seen from Table 3, in 203 (92.3%) of 
the problems, the formulation given in Aykin 
(1996) required less CPU time to locate an optimal 
integer solution than the extended formulation of 
Bechtold and Jacobs (1990). In 12 problems, the 
extended formulation of Bechtold and Jacobs 
(1990) required less CPU time than that of Aykin 
(1996). And in 4 of the remaining 5 problems (out 
of 220), the difference between the solution times 
with the two formulations was less than 0.5 sec- 
onds. Hence, these cases were reported as requir- 
ing the same amount of CPU time. Only one case 
remained unsolved with both formulations. 

Out of 220 problems, 187 of them were solved 
optimally by both formulations. The average CPU 
time for these problems with the extended formu- 
lation of Bechtold and Jacobs is 331.36 seconds. In 
contrast, for the same 187 problems, the average 
CPU time with the formulation of Aykin is 1 13.04, 
about 1/3 of the average time needed with the 
formulation of Bechtold and Jacobs. Table 4 
provides more detailed information concerning the 
solution times. In this table, the minimum, aver- 
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age, and maximum number of shifts, and mini- 
mum, average, and maximum CPU times with 
both formulations for the problems in each group 
are also presented. As seen from these results, the 
average solution time increases with both formu- 
lations as the number of shifts increases. The rate 
of increase with the formulation of Bechtold and 
Jacobs is, however, significantly higher than with 
the formulation of Aykin. Also note that the av- 



erage solution time needed with the extended for- 
mulation of Bechtold and Jacobs in these problem 
groups is between 1.76 and 5.84 times higher than 
that with the formulation of Aykin. Table 4 also 
shows the number of problems remained unsolved 
(after two passes) in each group. For each group, 
below the average solution time, we report (in 
parenthesis) the average CPU time spent per 
problem in the group to include the unsolved 




(A) 



8 8 S 8 S S S 8 8 8 8 8 8 8 8 8 8 3 § 8 S S 8 
Solution Time (CPU seconds) 



120, 




Fig. 2. Distribution of the solution times: (A) with the formulation of Aykin (1996), (B) with the formulation of Bechtold and Jacobs 
(1990). 
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problems. These averages were obtained by in- 
cluding 2400 seconds spent for each case that is 
remaining unsolved after two passes together with 
the actual solution times for the problem that were 
solved optimally. These average times provide a 
lower bound on the average solution times per 
problem. As can be seen, the lower bounds for the 
formulation of Bechtold and Jacobs are also 
higher than the corresponding lower bounds for 
the formulation of Aykin. 

Finally, Fig. 2 shows the distribution of the 
individual solution times. In this figure, the histo- 
gram shows the percentage of the problems with a 
solution time in the given interval, and the curve 
shows the cumulative percentage of the problems 
versus the solution time. The dashed lines mark 
the points where the scale of the solution time axis 
is changed due to the spread of the recorded so- 
lution times. As can be seen from this figure, in 
more than 60% of the problems, an optimal integer 
solution was obtained within the first 50 seconds 
with the formulation of Aykin and, in more than 
75% of the problems, within 100 seconds. Also 
note that about 15% of the problems required 
between 100 and 500 seconds and only 10% re- 
quired more than 500 seconds. In contrast, with 
the extended formulation of Bechtold and Jacobs, 
only about 25% of the problems were solved 
within the first 50 seconds, and about 37% within 
the first 100 seconds. About 30% of the problems 
required between 100 and 500 seconds, and 33% 
required more than 500 seconds. This approach 
requires more than 2400 seconds in 14.5% of the 
problems that remained unsolved. 

As discussed in Section 3.2, in terms of the 
number of constraints, the extended formulation 
of Bechtold and Jacobs (1990) is a more compact 
formulation of the problem than Aykin (1996). 
However, the results presented in this section show 
that this is not providing any computational ad- 
vantage over Aykin (1996). A comparison of the 
optimal LP objective values revealed that both 
formulations are producing the same LP objective 
values. However, in most cases (as reported in 
Table 3, 92.3%), solving (i.e. pivoting) the initial 
LP relaxation as well as the subsequent branch 
subproblems consume significantly more time with 
Bechtold and Jacobs (1990) than Aykin (1996). 



One reason for this may be the substantially higher 
density of the ^-matrix of the Bechtold and Jacobs 
formulation. In summary, these results show that 
the formulation given in Aykin (1996) offers better 
modeling reliability, and requires less memory 
and, on average, one-third of the time needed with 
the formulation of Bechtold and Jacobs (1990). 

4. Summary 

In this paper, we experimentally compared the 
extended formulation of Bechtold and Jacobs 
(1990) and the implicit modeling approach of 
Aykin (1996) to the general shift scheduling 
problem through solving 220 problems. We con- 
sidered two criteria; model reliability and solution 
time. In terms of model reliability, while 32 
(14.5%o) out of 220 problems remained unsolved 
after two passes with the extended formulation of 
Bechtold and Jacobs, only 2 (0.9%) of the prob- 
lems remained unsolved with the formulation of 
Aykin. Further, the average solution time for the 
problems solved by both approaches was 331.36 
seconds with the extended formulation of Bechtold 
and Jacobs, and 113.04 seconds with the formu- 
lation of Aykin. In summary, our results show that 
the formulation given in Aykin (1996), although 
requiring more variables than the equivalent for- 
mulation of Bechtold and Jacobs, requires smaller 
number of constraints, has lower density and 
smaller number of nonzeros in the ,4-matrix, offers 
better model reliability, and requires on average 
one-third of the time needed with the extended 
formulation of Bechtold and Jacobs. The fact that 
these results were obtained with a problem set in- 
cluding the largest shift scheduling problems 
solved optimally in the literature using an off-the- 
shelf integer programming software is very en- 
couraging in terms of the practical applications of 
the implicit scheduling models in service systems. 
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A composite branch and cut algorithm for optimal 
shift scheduling with multiple breaks and break 
windows 

T Aykin 

Bell Laboratories, New Jersey, USA 

This paper presents a new branch and cut algorithm for optimal shift scheduling with multiple breaks and break 
windows. The proposed algorithm is based on an implicit formulation of the problem requiring significantly smaller 
number of variables than the set covering formulation of Dantzig. The new algorithm, adding cuts, developing upper 
bounds for the variables, and employing an efficient rounding heuristic, was tested successfully with 90 test problems 
involving between 2160 and 32 928 shift variations and five demand profiles. Our results show that large shift 
scheduling problems can be solved optimally with the proposed branch and cut algorithm and the implicit formulation. 

Keywords: shift scheduling; service operations management; integer programming 



Introduction 

The shift scheduling problem arises in such situations as 
scheduling telephone operators, agents at telemarketing 
centres, and attendants at highway toll booths, etc. The 
problem involves determining the number of employees to 
be assigned to various shifts (specified by shift start time, 
shift length, number and type of breaks, etc) and the timing 
of their relief and lunch breaks so as to meet demand at 
minimum cost. Demand for the services provided by these 
systems typically varies during the course of an operating 
day. The problem may involve 24 h continuous (cyclic) or 
less-than-24 h operations. Since its introduction by Edie, 2 
both heuristic and integer programming based approaches 
have been studied for this problem. The integer program- 
ming based approaches have traditionally been based on the 
following set covering formulation provided by Dantzig. 1 

min Y, c k x k 0) 

subject to 

£ a h X k > b n for alU € T (2) 

keK 

X k ^ 0 and integer, k € K, 

where K is the set of all shifts, T is the set of planning 
periods that the shift schedule covers, b t is the number of 
employees needed in period /, c k is the cost of assigning an 
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employee to shift k, a h is one if period r is a work period for 
shift k and zero otherwise, and X k is an integer decision 
variable representing the number of employees assigned to 
shift k. 

Consideration of break flexibility has been shown to be 
important in determining shift schedules and minimum 
staff size to meet demand. 3 ' 4 The case involving multiple 
relief and lunch breaks and break windows better matches 
scheduling practice. The set covering model, however, 
requires that all possible combinations of different shift 
types, lengths, shift start times as well as all possible relief 
and lunch break start time combinations be enumerated and 
included as different shifts in K. When break windows (time 
intervals during which employees should start and 
complete their breaks) and the resulting break placement 
possibilities are included, the number of shifts in K 
increases exponentially. Since the set covering formulation 
requires an integer variable for every shift in K, the size of 
the integer model makes obtaining an optimal shift sche- 
dule impractical in all but some small cases. Showalter and 
Mabert, 5 for example, recently stated that \ . .Indiana Bell 
currently schedules staff where thousands of different shift 
schedules are present, due to 15 min planning periods, rest 
breaks, split shifts, and lunches. Since integer optimising 
procedures (i.e. branch and bound) requires significant 
computational effort to arrive at a solution when thousands 
of decision variables are present, we expect well-desig- 
nated heuristics to be attractive for staff planners for some 
time to come' (p 55). Similarly, Nanda and Browne 6 
concluded that "...Thus, while it is theoretically possible 
to use I LP (integer linear programming) for scheduling 
problems with variable breaks, for many applications it is 
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impractical to formulate, no less solve, by ILP' (p 206). 
Similar comments can be found in Bedworth and Bailey, 7 
Glover et ai s Easton and Rossin 9 among others. This view 
has also been shared by a number of researchers who have 
recently developed and implemented industrial applications 
(see, for example, Holloran and Byrn, 10 p 13). We are also 
aware of two decision support systems recently developed 
for shift scheduling: NAMES™ 10 by AT&T for use at 
telemarketing centres to schedule agents (Spencer et al, n ) 
and ROMAN by the Information Technology Institute, 
Singapore (Khbong and Lau 12 ). These systems also use 
heuristics for scheduling shifts and breaks. 

One popular strategy considered in the literature involves 
obtaining shift assignments by first solving a much smaller 
problem without including some or all relief and lunch 
breaks and break windows. A feasible shift schedule is then 
constructed by placing the relief and lunch breaks and 
adjusting the employee-shift assignments heuristically. 
Segal 13 applied this approach to a telephone operator 
scheduling problem in which each shift may include up to 
two 15min relief breaks, one 30min lunch break, and a 
1.5 h break window for every relief lunch break. He 
proposed a network flow based heuristic for solving the 
shift scheduling and the break placement problems sepa- 
rately in three phases. Keith 14 considered a similar operator 
scheduling problem with lunch and relief breaks at Illinois 
Bell Co. Assuming predetermined relief and lunch break 
start times (that is, ignoring break windows), the heuristic 
approach he proposed first solves the LP relaxation of a 
smaller set covering model and applies a rounding heuristic 
to obtain a feasible shift schedule. In the second part of his 
algorithm, break windows and alternative break placements 
are examined heuristically in an attempt to improve the shift 
schedule found in the first part. A number of other strategies 
have also been considered in the literature; Henderson and 
Berry, 15 and Bartoldi 16 considered heuristic solution of the 
complete set covering model. A construction type heuristic 
using the so-called Luce index was proposed by Buffa 17 for 
operator shift scheduling at the General Telephone 
Company of California. 

Although the solution side of the problem has been 
studied extensively, the modelling side has not received 
similar attention. The integer programming based 
approaches have traditionally used the set covering model 
of Dantzig. 1 Modelling a special case of the problem with a 
single break has been considered by several researchers. 
Bailey and Fields 18 considered a shift scheduling problem 
involving shifts with a single break and a break window. 
They used the set covering approach and enumerated all 
break placement possibilities. This case was also considered 
by Bechtold and Jacobs. 19 They presented an integer 
programming formulation using a set of break placement 
variables. The formulation presented in Bechtold and 
Jacobs 19 was, however, limited to the cases involving less- 
than-24 h operations, one break of identical duration for all \ 



shifts, a single break window per shift, and no 'extraordin- 
ary break-window overlaps' (described as a situation in 
which the break window for a shift is included entirely in 
the break window of another shift and the larger break 
window starts and ends, respectively, at least one planning 
period earlier and later than the smaller break window). 
Recently, Thompson 20 combined the formulation of Bech- 
told and Jacobs 19 with that of Moondra 21 in a shift schedul- 
ing environment requiring a high degree of flexibility. 
Besides having the limitations of Bechtold and Jacobs, 19 
the formulation presented by Thompson 20 is also limited by 
the fact that the degree of shift length and start time 
flexibility needed to realize the benefits of this approach is 
rare in practice. Moreover, due to the modelling approach 
used, the length of the break window has to decrease with 
the length of the shift (for example if a 9 h shift has a 3 h 
long lunch break window, a 7 h shift of the same type has to 
have a 1 h lunch break window). Another important limita- 
tion of this formulation (also pointed out in Thompson 20 ) is 
its ability to model a realistic cost structure. In summary, 
past attempts to develop efficient modelling tools took 
advantage of the special structure of the cases considered 
and are not readily generalizable. 

In the typical shift scheduling environment found in 
service industry, an employee working 7-9 consecutive 
hours a day receives one lunch break and two rest breaks, 
one before and one after the lunch. Shifts with shorter work 
span may be given fewer breaks. The length of a lunch 
break is usually a half hour to an hour and a rest break 
15— 30min. Furthermore, when multiple breaks are given, 
usually separate break windows are specified for these 
breaks to provide scheduling flexibility and timely relief 
to employees. 6 ' 7 ' 13 ' 15 Also, in some systems, namely tele- 
phone operator centres, employees may be assigned to split 
shifts consisting of two work periods separated by an idle 
period lasting 2-4 h. A rest break is also provided in each 
work period. In this paper, we propose a new approach for 
solving this problem (referred to as the general shift sche- 
duling problem) optimally. Since, when formulated as a set 
covering model, the problem size presents the biggest 
obstacle to obtaining an optimal shift schedule, we first 
discuss an implicit formulation for the general shift schedul- 
ing problem (a discussion of the equivalence of these 
models together with an illustrative example is given in 
Aykin 3 ). Rather than enumerating all possible break place- 
ments and associated shift combinations, two sets of vari- 
ables, shift assignment variables and break variables, are 
used to assign employees to shifts and to schedule sufficient 
numbers of breaks for them. Compared to the set covering 
formulation of Dantzig, 1 this approach reduces the problem 
size greatly (see Tables 1 and 3). Furthermore, all shift 
scheduling problems that can be formulated with the set 
covering approach can be analyzed with this approach. The 
proposed branch-and-cut algorithm first solves the LP 
relaxation of the implicit formulation to obtain a lower 
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bound. An efficient rounding heuristic is then applied to the 
LP solution in an attempt to obtain a good upper bound and 
a feasible schedule. If this results in a solution with an 
objective value equal to the lower bound, the algorithm 
terminates with an optimal solution. Otherwise, it adds objec- 
tive value cuts and upper bounds on the variables, and itera- 
tively updates these cuts and bounds while using the rounding 
heuristic and a limited branch and bound search to find an 
optimal schedule. As reported in the computational results 
section, our experience with the branch and cut algorithm 
showed that the proposed approach is very efficient in solving 
large shift scheduling problems optimally. The scheduling 
problems we considered involve between 2160 and 32 928 
shift variations. Some of these problems are, to the best of our 
knowledge, the largest shift scheduling problems solved opti- 
mally in the literature. 



W kt = the number of employees assigned to shift k and 
starting their lunch break in period /. 

The break variables U ktX , V h2 and W m are defined for k e K 
and for t\ e B\ k , tl e B2 k and t3 e BL ki since breaks can 
only be scheduled within their windows. The notation is 
illustrated in Figure 1 . 

Different than the set covering approach, the shift set K 
in the implicit formulation does not include the shift 
variations due to different break placements. Suppose 
there are q different shift types, and each shift type can 
start at s h different times during a work day. Then, each shift 
type and shift start time for that type is identified as a 
different shift in K (namely K includes E/Li s h shifts). This 
problem with multiple breaks and break windows can now 
be formulated as follows. 



Problem formulation 

To present the implicit formulation, we assume the follow- 
ing scheduling environment: Each employee receives two 
relief breaks and one lunch break; a work day is 24 h or 
less, divided into equal planning periods (eg 15min inter- 
vals); each relief break covers one and each lunch break 
covers two planning periods. The formulation presented, 
however, is not restricted with these assumptions. Model- 
ling shifts with more (or less) breaks and/or longer break 
durations is discussed in Aykin. 3 The following sets and 
parameters are needed to specify each shift k e K: 

c k = the cost of assigning an employee to shift k 
a h = I if period t is in the shift span of shift k (that is, a 

work or a break period) and zero otherwise 
Zn^the sets of planning periods in which a worker 

assigned to shift k may start his/her first relief break 
52^-the sets of planning periods in which a worker 

assigned to shift k may start his/her second relief 

break 

BL k — the sets of planning periods in which a worker 
assigned to shift k may start his/her lunch break 

Also define the following sets: 

AT = the set of all shift type-shift start time combinations 
n r = the set of shifts for which period t is a first relief 

break start period 
77, - the set of shifts for which period / is a second relief 

break start period 
TL t = the set of shifts for which period / is a lunch break 

start period 

Finally, define the following decision variables: 

X k = the number of employees assigned to shift A' 

Ufr = the number of employees assigned to shift k and 

starting their first relief break in period t 
^ 7 = the number of employees assigned to shift k and 

starting their second relief break in period / 



Problem PI 

min ]T cA 0) 

k<=K 

subject to 

E a kt*k - E u kt~ E "V-d 

keK k€T), te77 ( ;-i) 

- E w kt~ E Vkt^K for all /eT, (4) 

keTL, keTl, 

*k- E Ck = 0, for all keK, (5) 

t<=B\ k 

Xk~ E W h = 0 9 for all k e K y (6) 

tzBL k 

X k - E ^ = 0, for all k e K, (7) 

teB2 k 

X k , W kn V kJ ^ 0 and integer for all k and t. 

In Problem PI, constraint (4) ensures that the number of 
employees assigned to various shifts and available in period 
t minus the number of employees taking their first relief, 
lunch and second relief breaks is sufficient to meet the 
demand b, in that period. On the left side of (4), the third 
and the fourth terms give the numbers of employees on their 
lunch break in period t including those who have started 
one period earlier in (/ - 1) and continuing in period t> and 
those just starting in period f, respectively. Break 
constraints (5—7) require that one first relief, one lunch, 
and one second relief break be scheduled for every 
employee assigned to shift k e K. The /1-matrix (the 
matrix of constraint coefficients) for the two-shift example 
shown in Figure 1 is given in Figure 2. 

Note that in the proposed formulation, although suffi- 
cient numbers of relief and lunch breaks are scheduled for 
all employees by the break constrains (5-7), their timing 
for an employee working a specific shift is not determined, 
in fact, provided constraints (5-7) are satisfied, assignment 
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T1i7={2> 
TL17= {1} 
T2 17 = ? 



1 Shift 1 — 5? 

1 st Relief Break Lunch Break 2nd Relief Break 

B1, = $.7,8,9.10,1 1} BL, != {15,16.17,18,19} B2, = {25,26^7^8,29^0} 

\ <: i m ^ i< n 

r t i ; i *i i i g ; i i i i i | m| i i i i i — i — i j — i — i — i — i — i — i — i — i i i 1 

2 4 6 8 10 12 14 16 ! 18 20 22 24 26 28 30 32 34 36 40 44 48 



j | K=.{1.2} 

< H Shift 2 > 

1st Relief Break Lunch Break 2nd Relief Break 

i r t • i i i ! i ! i i i i i i i i i n i i i { t i i i 1 

4 8 10 12 14 16, 18 20 22 24 26 28 30 32 34 36 38 40 42 44 48 



TIME: 7:00 8:00 9:00 10:00 11:00 12:00 13:00 14:00 15:00 16:00 17:00 18:00 19:00 



H : Ideal break time 

Figure 1 Shifts, break windows, break start times and the sets T\ k , TL kl and T2 k . 



of employees to specific break start time combinations does 
not affect the total number of employees needed to meet the 
demand during a work day. After obtaining an optimal 
solution to Problem PI, employees assigned to a shift can 
be assigned to specific break combinations in a variety of 
ways. One approach may involve assigning employees to 
the earliest first relief, lunch and second relief break times 
scheduled in the optimal solution for their shift. Since 
sufficient numbers of relief and lunch breaks are scheduled 
in the optimal solution for all shifts, this approach provides 
an efficient means for developing a complete schedule from 
an optimal solution of Problem PI. 

Another version of the implicit formulation with fewer 
variables than both Problem PI and the set covering 
formulation, and fewer constraints that Problem PI can 
be obtained by using one of break constraints (5-1) to 
replace X k with a set of break variables. This, however, 
increases the number of nonzeros and the density of the A- 
matrix. We didn't consider this version any further since 
our preliminary runs showed that the resulting formulation 
requires significantly more memory and time to solve than 
Problem PI. 



Branch and cut algorithm 

The branch and cut algorithm we propose first solves the 
LP relaxation of Problem PI. A lower bound is provided by 
this solution. Before introducing cuts and using a branch 
and bound procedure, an efficient rounding heuristic is 
applied to the LP solution to obtain a good upper bound 
and a feasible schedule. If the upper bound found by the 
heuristic equals the lower bound, the branch and cut 
algorithm terminates with an optimal solution. Otherwise, 
the schedule found by the heuristic is recorded and the 
objective value for this solution is stored as the upper 
bound. The branch and cut algorithm then adds cuts, 
updates the upper bounds on the variables, and solves its 
LP relaxation again. The rounding heuristic is then applied 
to this solution and the new schedule is recorded if it 
produces a tighter upper bound. The search is terminated if 
the new upper bound is equal to the lower bound. Other- 
wise, a limited branch and bound search is conducted to 
determine if a feasible solution with an objective value 
equal to the current lower bound exists. If one is found, the 
algorithm terminates with an optimal solution. If no such 
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Figure 2 The constraint coefficients for the 2 shift example. 



solution exists, the lower bound and cuts, are updated and 
these steps are repeated. In the following sections, we 
present the initial bounds used in the algorithm, the round- 
ing heuristic, and procedures for updating the lower and 
upper bounds. In this section, we assume that all objective 
function coefficients c k (shift assignment costs) in (3) are 
equal (for convenience, to one). The case involving unequal 
assignment costs is discussed in the Appendix. 



Initial bounds 

A lower bound on the number of employees needed is 
obtained from the demand constraints (4) as 



where SL max = maLX k^{^k "~ (number of break periods 
taken by employees assigned to shift k)} and SL k is the 
number of planning periods covered by shift k. 

One can also obtain upper bounds on the decision 
variables using again the demand constraints. Since, 
X k ^ U kv X k ^ W h and X k ^ V kt by definition, any upper 
bound on X k is also an upper bound on the associated break 
variables. First, notice that if the employees working shift 
k are not given any break and are to work SL k periods, 
say periods f = 1 , . . . , SL k without losing generality, 
then X k ^ 6 max where 6 max = max{6, , . . . , b SLk ). Thus, if 
X k = fe max , then employees working shift k will be sufficient 
to meet demand during periods t = 1 , . . . . SL k even without 
the employees assigned to other shifts. But, since the 
employees are given relief and/or lunch breaks, more 
employees may be needed especially during the break 
windows in order for every employee to take his/her 
breaks and to meet demand at the same time. Consider a 
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one-period relief break with a break window allowing n Xk 
possible break start times in B\ k . If this is the only break the 
employees assigned to shift k are given, then an upper 
bound on X k is given by 



UB1* = 



(A 



max - E (*max ~ b t)Y 



0»u - 0 



+ 1 



otherwise. 
(9) 



where [y] is the largest integer less than or equal to y. 

If the employees are given a lunch break lasting two 
periods, an upper bound on X k can be obtained as follows. 
Let the maximum number of lunch breaks that can be 
scheduled with b max employees be I max . If this is the only 
break that the employees take, then an upper bound is given 
by 



UBL* = 



*max + 



Ufc + D/2] - 1)J 



if ^max ^ ^max > 



+ 1 }, otherwise. 

(10) 



An upper bound UB* for a shift variable X k including two 
or more breaks can be obtained by taking the highest of 
these bounds. The branch and cut algorithm initially 
computes UB^ for all shift variables X k and include them 
as simple upper bounds. 



Rounding heuristic 

The rounding heuristic used in connection with the branch 
and cut algorithm provides an efficient means for obtaining 
a feasible schedule from the solution of the LP relaxation of 
Problem PI. As reported in the computational results 
section, solutions obtained with this heuristic provided 
tight upper bounds and, in many cases, turned out to be 
optimal. 

The heuristic consists of two phases: A construction 
phase and an improvement phase. To start the construction 
phase, first the LP relaxation of the problem in the current 
iteration of the branch and cut algorithm is solved. Let X k ? , 

u h ? > w h ? and v t p be the values ° f the shift and break 

variables in one such solution. The shift variables X k are 
rounded to the nearest integer; that is, if the fractional part of 
X k LP is less than 0.5, set X k = [X k L? ]. And if the fractional 
part of X k L? is greater than or equal to 0.5, then set 
X k — (X k LP ), where (a) is a function returning the smallest 
integer greater than or equal to a. The break variables are all 
rounded down, namely set U h = [t/£ p ], W h = [Wjf] and 



V h = [Vjfl Note that since the shift variables and the break 
variables are rounded differently and separately, not enough 
breaks may be scheduled for the employees assigned to shift 
k. The opposite (that is, availability of more breaks than the 
scheduled employees), however, is not possible. Therefore, 
before proceeding any further, the rounding heuristic sche- 
dules additional relief and lunch breaks whenever sufficient 
numbers of breaks are not scheduled for a shift. To 
schedule new breaks, the rounding heuristic determines 
the number of employees that are available (excluding 
the ones on break) minus the number needed in each 
planning period. Let S t be this difference for period t (if 
positive, referred to as surplus, if negative, as shortage). 
Then, an additional break of the type needed for a shift is 
scheduled in the period with the highest surplus (smallest 
shortage) in the break window of that break type. And if 
two or more additional breaks are needed for a shift, they 
are scheduled in this fashion, one at a time, while updating 
S k values. This procedure guarantees sufficient numbers of 
breaks for all scheduled employees while minimizing, one 
break at a time, the effects of scheduling additional breaks in 
terms of the number of employees needed. 

After scheduling sufficient numbers of breaks for all 
shifts, the minimum period surplus/shortage S min 
= min, €r {S,} is determined. If 5 min < 0, then the partial 
schedule available does not satisfy all period demand 
constraints (4). In this case a new employee is assigned to 
one of the shifts in an attempt to satisfy demand in all 
planning periods. To assign the new employee to a shift, first 
the minimum period surplus/shortage within the shift span 
of each shift, 5£" n = min ter {a h S t }, is determined. The new 
employee is then assigned to the shift with the largest period 
shortage, namely S m . = mm keK sf m . Ties are resolved using 
the total shortage covered by the shifts during their shift 
span. The number of employees assigned to shift m* is 
increased by one, X m . = X m . + 1, and the rest and lunch 
breaks for the new employee are scheduled in the same way 
the additional breaks are scheduled after the initial rounding. 
We have considered a number of schemes for choosing 
shifts to assign new employees such as the total shortage 
covered during the shift span. Our preliminary tests, 
however, showed no improvement over the maximum 
period shortage approach discussed here. 

After adding a new employee, the rounding heuristic 
checks for the redundant employees. This is done by 
considering dropping an employee from each shift (one 
shift at a time) while determining this employee's breaks 
(also to be dropped) as the ones that are scheduled in the 
periods with the smallest surplus (or highest shortage). 
When it is found that an employee can be removed from 
the current partial schedule without causing any additional 
shortage during his/her shift, the value of the associated 
shift variable X k is decreased by one. The shift schedule is 
checked for redundant employees every time a new 
employee is added or when a feasible schedule satisfying 



T Aykin — A composite branch and cut algorithm 609 



all period demand is found. When the latter condition 
occurs, the construction phase is completed. 

In the improvement phase, the rounding heuristic 
attempts to lower the number of employees in the feasible 
schedule found in the construction phase by replacing two 
employees that are assigned to two different shifts by one 
new employee. For this purpose, two shifts with nonzero 
assignments in the current schedule are selected and one 
employee from each of these shifts is dropped together with 
the scheduled breaks for these employees. The new 
employee is added by considering all possible shifts. And 
the breaks for the new employee are scheduled to the 
periods in the respective break windows with the highest 
surplus (or smallest shortage if no period with surplus 
exists within the break window). If, after assigning the 
new employee to a shift, no period with shortage exists, the 
new schedule is feasible and replaces the current schedule 
with one less employee. The heuristic is terminated when 
no two employees can be replaced by one while maintain- 



ing feasibility. A flow chart of the rounding heuristic is 
given in Figure 3. 

Updating lower and upper bounds 

The lower bound on the objective value and the upper 
bounds on the decision variables are updated if the round- 
ing heuristic can not locate an optimal solution in an 
iteration. Let Z LP be the objective value for the LP solution 
found in the current iteration. Then the total number of 
employees needed is at least LB = (Z LP }. Hence, the 
proposed algorithm adds the following constraint to 
Problem PI 

£** = LB (11) 

Note that since the numbers of relief and lunch breaks that 
need to be scheduled should be equal to the number of 
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Figure 3 Flowchart for the rounding heuristic used in the branch and cut algorithm. 
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employees, the following constraints should also be satis- 
fied 



E E v kJ = lb 

keKjeB\ k 



E E *V = lb 

keKj<zB2 k 



kzKjGBL k 



W k1 = LB 



(12) 
(13) 
(14) 



Using the objective value cut (11), upper bounds on the 
shift variables X k can be updated as follows. Suppose shift k 
covers SL* planning periods (without losing generality, 
t = 1, . . . . SL k ). Now consider the demand constraints (4) 
for the planning periods not covered by shift k; that is, the 
demand constraints for the periods t = (SL k 4- 1), . . . , n T , 
where n T is the number of periods in T. By aggregating 
these constraints, we obtain 

E E V9- E E u„ 

t=(SL k +\)j€K; j+k t=(SL k +Y) i€T\, 

- E ( E ^-0+ E 

- E E ^ > E 6/ d5) 

Note that if another shift, say j, and shift k have no common 
planning periods then the variables associated with shift j 
on the left side of (15) can be regrouped as 



SLjXj- E u Jt -2 z w it - E Vj 



teB\ 



tr z BLi 



t€B2j 



Using (5 to 7) and (16) can be simplified as 

{SLj-4)Xj 



(16) 



(17) 



where (SLj — 4) is the number of work periods for shift j, 
with four periods (out of SLj) reserved for relief and lunch 
breaks (for shifts with longer or more breaks, this is replaced 
by the total break duration for the shift). If, on the other 
hand, shifts j and k have some common planning periods, 
say A periods, then the variables associated with this shift 
in (15) can be regrouped as 

(SLj — A)Xj - £ U Jf - £ W jt - £ V jt (18) 



tzBV. 



ieBV 



reBl'j 



where B\j, BLj and B2j are the sets of planning periods in 
the break windows for the first relief, lunch and second 
relief breaks of shift j that are not in the shift span of A'. Note 
that (18) is always less than or equal to {(SLj - 4)Xj\ since 
this is the maximum number of planning periods the 
employees assigned to shift j are available for work. As in 
(18) ; if one considers a particular part v of shift / during a 
work day. the number of work periods in that part will be 



less than this quantity. Now using this together with (17), 
(15) can be rewritten as 

£ (ffl y -4)^> £ b t (19) 

jeKitfk t(SL k +\) 

Let 5L max = mdXj^ k {SLj - 4}. Then, (19) can be written 
as 



E 



SL mAX 



(20) 



Since Etejc-** = 15 by (11), a new upper bound on X k is 
obtained using (20) as follows 

H<6D k b t 



X t < LB - • 



(21) 



where D k is the set of periods not covered by shift k. Our 
experience with the problems solved in this study showed 
that (21) provides a tighter upper bound for X k than the 
initial upper bound obtained from (9-10). Occasionally, 
however (21) may not produce a tighter bound (for example, 
as LB is increased in the subsequent iterations of the branch 
and cut algorithm). In that case, the lower of these two 
bounds is taken as the upper bound. 

Branch and cut algorithm 

We now outline the proposed branch and cut algorithm for 
Problem PI. 

Step 0. Calculate and include the initial upper bounds UB^. 
using (9) and (10) for X k in Problem PI. 

Step L Solve the LP relaxation of Problem PI. Let the 
objective value for the optimal LP solution be Z LP . 
If the LP solution is integer, stop; an optimal 
solution is found. Otherwise, let LB = (Z LP > and 
apply the rounding heuristic. Let the objective 
value for the heuristic solution be OBJVAL. If 
OB JVAL = LB, stop; an optimal solution is found. 
Otherwise, record the solution and go to Step 2. 

Step 2. Add constraints (11-14) and update the upper 
bounds for the variables using LB and (21). 

Step J. Solve the LP relaxation of PI with the new cuts. If 
the LP solution is integer, stop; an optimal solution 
is found. Otherwise, apply the rounding heuristic to 
the new LP solution. If a solution with an objective 
value lower than OBJVAL is found, update the 
solution and OBJVAL. If OBJVAL = LB, stop; an 
optimal solution is found. Otherwise to go Step 4. 

Step 4. Apply branch and bound to determine if a feasible 
solution with an objective value of LB exists. If 
a solution with an objective value of LB is located, 
stop; an optimal solution is found. Otherwise, go to 
Step 5. 

Step 5. Set LB = LB+ I, and update the right hand side 
of the objective value cuts added in Step 2. Update 
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the upper bounds for the variables using LB and 
(21), and go to Step 3. 

The branch and cut algorithm stops when the LP solution 
is integer, a heuristic solution with OBJVAL equal to LB is 
found, or a feasible solution with an objective value of LB 
is found in Step 4. In all these three cases, the solution has 
an objective value equal to the lower bound and, therefore, 
is an optimal solution to Problem PL 

Computational results 

We studied the performance of the proposed branch and cut 
algorithm by solving two groups of problems containing a 
total of 90 cases. The branch and cut algorithm was coded 
in FORTRAN. This program generates and inputs problem 
data, calls various LTNDO subroutines for solving linear 
programming relaxation and branch and bound search, and 
applies the rounding heuristic. All test problems were 
solved in batch mode on a Sun Sparc 512 computer. 



Scheduling environment 

The scheduling environment chosen for this study involves 
a continuous 24 h work day (cyclical scheduling environ- 
ment) to test the proposed approach in large scheduling 
problems. Employee requirements are determined for 
15min planning periods (4 x 24h = 96 planning periods) 
indexed successively T = {1, . . . , 96}. Concerning the shift 
start times, we considered two groups of test problems while 
keeping all the other aspects of the scheduling environment 
the same: In the first group of problems, shifts are assumed 
to start on the hour and on the half hour, resulting in 48 shift 
start times in a work day. In order to test the proposed 
approach in even larger cases, in the second group of 
problems, shifts are assumed to start every 15 minutes, 
resulting in 96 shift start times in a work day. Duration of a 
shift is taken as 9 h including breaks. It is assumed that each 
employee is given one half-hour lunch break (two 15min 
periods) and two 15min relief breaks, one before the lunch 
break and one after. We specify the break windows for each 
shift with reference to ideal break times as follows: the ideal 
start time for the first relief break is specified as 2 h after the 
start of a shift, the ideal lunch break time is located in the 
middle of the work day after 4h of work (that is, after 4h 
plus 15min for the first relief break) and the ideal second 
relief break time is specified as 2h after the completion of 
the lunch break starting at the ideal lunch time. The break 
windows are formed by setting the earliest break start times 
half an hour earlier than the ideal break times. The lengths 
of break windows are varied to carry out the computational 
experiments with problems of varying difficulty level. The 
relief and lunch break windows considered allow 3-7 break 
start times (see Tables 1 and 3). We considered five different 
demand patterns, resulting in 60 problems in the first 



problem group and 30 problems in the second group. 
Three of the demand patterns considered were found in 
the literature and were either obtained directly from or 
resembling to real patterns from service systems: (i) the 
trimodal demand profile for telephone operators provided by 
Segal 13 with a minimum quarter hour demand of 2 and a 
maximum of 89 employees, (ii) the bimodal demand profile 
with a minimum quarter hour demand of 2 and a maximum 
of 9 obtained from the General Telephone and Electronics 
Company of California and given in Henderson and Berry, 15 
and (iii) the unimodal demand profile given by Buffa 17 with 
a minimum quarter-hour demand of 2 and a maximum of 27 
that was obtained from the General Telephone Company of 
California. In addition to these three demand patterns, we 
considered two synthetically generated patterns; a bimodal 
and a trimodal sinusoidal demand patterns with an average 
quarter-hour demand of 27.5. Minimum and maximum 
quarter-hour demand across these two demand patterns 
were 5 and 50, respectively. Like in Henderson and 
Berry, 15 the cost of assigning employees to shifts is assumed 
to be the same and equal to one. 



Computational results with the test problems 

The number of variables, number of constraints, number of 
nonzero coefficients in the A-matrix and its density for 
Problem PI and the set covering model are shown in Table 
1 for the first group of problems and in Table 3 for the 
second group of problems. As seen in Tables 1 and 3, the 
implicit formulation requires substantially smaller, between 
73.3-93.6% i ess> num ber of variables than the equivalent 
set covering formulation. The problems characteristics 
shown in Tables 1 and 3 for the set covering approach 
exceed the limits of most hardware/software configurations 
including the one used in this study. Hence, solving these 
test problems optimally with the set covering approach was 
not possible. 

The minimum, average and maximum solution times for 
the first group of problems are shown in Table 2. Also 
reported in this table are the minimum, average and 
maximum LP solution times and the heuristic search 
times within the branch and cut algorithm. Our preliminary 
test runs showed that break variables U h , W h and V h tend 
to be integer when the associated shift variable X k has an 
integer value. Therefore, a branching strategy giving higher 
priority to shift variables X k was implemented in the branch 
and bound search with LINDO (step 4 of the proposed 
algorithm). When necessary, we made two attempts to solve 
a problem with two different branching strategies. In the first 
attempt, the shift variables were selected for branching 
according to their start times. We set the CPU time limit 
to 500 s. When the proposed branch and cut algorithm could 
not locate an optimal solution in the first attempt within this 
time limit, a second attempt was made by selecting shift 



612 Journal of the Operational Research Society Vol. 49, No. 6 
a 



Table 1 Set covering vs proposed model: problem characteristics for the first group of problems 



Number of % nonzeros in 

Number of variables constraints Number of nonzeros in A-matrix A-matrix 







set 


Proposed 


% reduction 


Set 


Proposed 


Set 


Proposed 


% reduction 


Set 


Proposed 


n \k ~ n 2k 


n Lk 


covering 


model 




covering 


model 


covering 


model 




covering 


model 


3 


5 


2160 


576 


73.3 


96 


240 


60120 


3168 


95.4 


33.3 


2.3 


4 


4 


3072 


624 


79.7 


96 


240 


98304 


3216 


96.7 


33.3 


2.1 


4 


5 


3840 


672 


82.5 


96 


240 


122880 


3360 


97.3 


33.3 


2.1 


4 


6 


4608 


720 


84.4 


96 


240 


147456 


3504 


97.6 


33.3 


2.0 


5 


3 


3600 


672 


81.4 


96 


240 


115200 


3264 


97.2 


33.3 


2.0 


5 


4 


4800 


720 


85.0 


96 


240 


153600 


3408 


97.8 


33.3 


2.0 


5 


5 


6000 


768 


87.2 


96 


240 


192000 


3552 


98.2 


33.3 


1.9 


5 


6 


7200 


816 


88.7 


96 


240 


230400 


3696 


98.4 


33.3 


1.9 


6 


5 


8640 


864 


90.0 


96 


240 


276480 


3744 


98.6 


33.3 


1.8 


6 


6 


10368 


912 


91.2 


• 96 


240 


331776 


3888 


98.8 


33.3 


1.8 


7 


6 


12096 


1008 


91.7 


96 


240 


387072 


4080 


98.9 


33.3 


1.7 


7 


7 


16464 


1056 


93.6 


96 


240 


526848 


4224 


99.2 


33.3 


1.7 



variables for branching according to the total demand 
covered by the associated shifts. When there is a tie, the 
shift variable with the earliest shift start time is given higher 
priority. If a second attempt was needed, the solution time 
reported for that problem is the total time for the two 
attempts. Hence, the CPU time allowed for two attempts 
was 1000 s. 

With the branch and cut algorithm, out of 60 problems 
considered in the first group, 54 were solved optimally in 
the first attempt and 4 more in the second attempt. In only 2 
problems out of 60, one involving n ik — n 2k = 6 and 
= 5 and the other n ik = n lk — 5 and n Lk = 3, an optimal 
solution was not located after two attempts within the time 
limit. In these two cases, however, the best feasible schedule 
found by the branch and cut algorithm (with the rounding 
heuristic) required only one more employee than the best 
lower bound. Thus, in these two cases, the best solution 
obtained with the branch and cut algorithm was the closest 



possible solution ('best nonoptimaP solution) to the optimal 
solution in terms of the number of employees needed. 

The rounding heuristic used in the branch and cut 
algorithm is useful for a number of reasons. First, it is 
very efficient and quite often locates an optimal solution; it 
located an optimal solution in 28 (out of 60) of the 
problems considered in this group. When this is not the 
case, this heuristic provides very good feasible solutions 
and tight upper bounds. In fact in all but one of the 
remaining 32 cases, the best heuristic solution required 
only one more employee than the best lower bound. Thus, 
in the worst case, the heuristic solutions obtained in these 
cases were the best nonoptimal solutions possible. 

We analysed the performance of the branch and cut 
algorithm further by plotting a histogram of the total 
solution times (Figure 4). As seen from this histogram, in 
70% of the problems (that is, 42 out of 60 problems), an 
optimal solution was obtained within 40 CPUs, and in 



Table 2 LP solution, heuristic search and total solution times with the branch and cut algorithm (CPU s) — first group of problems 



Total solution time with 

Break LP solution time Heuristic search time the branch and cut algorithm 
Window ! 



Combination 


n \k = n 2k 




Min 


Average 


Max 


Min 


Average 


Max 


Min 


Average 


Max 


1 


3 


5 


2.90 


6.42 


12.96 


* 


0.28 


0.98 


3.23 


14.60 


47.61 


2 


4 


4 


2.76 


8.02 


22.95 


* 


0.72 


2.05 


2.90 


119.33 


522.31 


3 


4 


5 


3.51 


8.41 


18.30 


1.07 


1.66 


2.49 


17.44 


45.89 


99.94 


4 


4 


6 


2.96 


4.85 


7.49 


0.64 


1.69 


2.38 


14.71 


84.90 


185.29 


5 


5 


3 


2.90 


10.76 


25.62 


0.36 


1.41 


2.03 


21.08 


299.09 


747.72 


6 


5 


4 


3.27 


7.38 


18.61 


0.77 


1.35 


2.59 


10.45 


• 136.54 


497.04 


7 


5 


5 


3.94 


9.28 


15.66 


1.08 


1.62 


2.73 


13.84 


30.17 


37.89 


8 


5 


6 


2.31 


4.30 


9.05 


0.02 


1.23 


2.30 


2.53 


116.15 


504.08 


9 


6 


5 


5.39 


10.21 


16.75 


0.04 


1.32 


2.67 


7.24 


18.19 


29.35 


10 


6 


6 


3.59 


5.16 


7.92 


0.04 


0.94 


2.98 


3.59 


33.78 


130.40 


11 


7 


6 


2.77 


5.95 


13.49 


0.04 


0.61 


1.90 


3.92 


137.11 


439.30 


12 


7 


7 


1.97 


5.35 


7.72 


0.04 


0.45 


1.02 


2.01 


117.93 


571.80 



*<0.01 s. 
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Table 3 Set covering vs proposed model: problem characteristics for the large problems 

Number of % nonzeros in 

Number of variables constraints Number of nonzeros in A-matrix A-matrix 
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Set 
covering 


Proposed 
model 


Set 
covering 


Proposed 
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Proposed 
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5 


5 


12000 
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87.2 


96 


384 


384000 


7104 


98.2 


33.3 


1.9 


5 


6 


14200 


1632 


88.7 


96 


384 


460800 


7392 


98.4 


33.3 


1.9 


6 


5 


17280 


1728 


90.0 


96 


384 


552960 


7488 


98.6 


33.3 


1.8 


6 


6 


20736 


1824 


91.2 


96 


384 


663552 


7776 


98.8 


33.3 


1.8 


7 


6 


24192 


2016 


91.7 


96 


384 


774144 


8160 


98.9 


33.3 


1.7 


7 


7 


32928 


2112 


93.6 


96 


384 


1053696 


8448 


99.2 


33.3 


1.7 



about 78% (47 out of 60), within 100 CPUs. In only about 
22% (13 out of 60) of the problems considered, the time 
needed exceeded 100 CPUs. 

In the second group, we considered 30 large shift 
scheduling problems involving 96 shift start times and the 
largest 6 break window combinations from Table 1. The 
branch and cut algorithm performed remarkably well in 
these large problems; 24 out of 30 problems were solved 
optimally in the first attempt, and 3 more in the second 
attempt. Only in three cases an optimal solution was not 
located. However, the best solution obtained with the 
rounding heuristic in these three cases, required again one 
more employee than the best lower bound. Thus, in the 
worst case, the 'best nonoptimaF solutions were obtained. 
Concerning the overall performance of the rounding heur- 
istic, we observed that it located an optimal solution in 14 



out of 30 problems. And in the remaining 16 cases, the 
rounding heuristic located the 'best nonoptimar solutions. 
The minimum, average and maximum LP solution times, 
heuristic search times and the total solution times with the 
branch and cut algorithm for the second group of problems 
are shown in Table 4. For this group of problems, we set the 
CPU time limit to 750 CPUs. Note that the solution time 
does not seem to be increasing with the problem size, 
although the times reported are somewhat higher compared 
to the first group of problems (Table 2). This is expected 
since the size of the problems in the second group is 
substantially larger (containing as many as 32 928 shift 
variations) than those in the first group. The distribution of 
the total solution times was as follows; 1 1 problems (37%) 
required less than 40 CPUs and 18 cases (60%) less than 
100 CPUs. 



14 
12 
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Table 4 LP solution, heuristic search and total solution times with the branch and cut algorithm (CPU s) — large problems 



Total solution time with 

LP solution time Heuristic search time the branch and cut algorithm 



n \k = n ik n Lk Min Average Max Min Average Max Min Average Max 



5 


5 


9.29 


26.08 


68.93 


0.02 


5.96 


10.30 


25.02 


235.91 


871.26 


5 


6 


7.04 


31.40 


65.32 


0.02 


5.12 


10.64 


7.09 


118.31 


372.00 


6 


5 


5.83 


23.15 


38.55 


0.05 


3.95 


7.48 


5.97 


93.13 


253.58 


6 


6 


7.59 


32.91 


62.28 


0.02 


3.50 


9.43 


7.61 


380.22 


1403.61 


7 


6 


8.81 


21.08 


59.97 


0.04 


2.18 


10.16 


8.99 


167.02 


437.27 


7 


7 


5.10 


19.69 


31.73 


0.42 


4.36 


9.68 


6.55 


228.49 


888.23 



Conclusions 

In this paper, we presented a new branch and cut algorithm 
for optimal shift scheduling. We tested the new approach 
by solving two groups of test problems involving a total of 
90 cases. The problems considered involved as many as 
32 928 shift variations. These problems, to the best of our 
knowledge, are the largest solved optimally in the litera- 
ture. Our results showed that with the implicit formulation 
and the branch and cut algorithm, large shift scheduling 
problems can be solved optimally. Our results also showed 
that the rounding heuristic used in the branch and cut 
algorithm often locates an optimal solution frequently. 
And when this is not the case, it is most likely to locate 
the 'best nonoptimal' solution. The proposed approach may 
be extended in a number of directions. An important 
extension would involve its application to the tour schedul- 
ing problem. The tour scheduling problem involves both 
daily shift scheduling for the work days in a week and days- 
off scheduling. Another area in which more work needs to 
be done is the development of decision support systems 
offering various scheduling and modelling flexibilities. 
This would be particularly important for the service orga- 
nizations such as telephone operator centres and telemar- 
keting centres where the workforce may account up to 60% 
or more of total operating expenses. 



Appendix 

Let H be the set of shift types with different wage rates. 
There may be several shift start times associated with each 
shift type in //. Let set K again contain all shift type-shift 
start time combinations. We are assuming that all employ- 
ees assigned to a particular shift type in //are paid the same 
wage c h ,h eH. An initial lower bound on the objective 
value in this case is obtained as 



LB 0 = 



in EfgT 



(Al) 



where c min = min Ae// {c A } 



The initial upper bounds calculated using (9) and (10) are 
independent of c /r Hence, they are valid in this case with 
unequal shift assignment costs. In Step 1 of the branch and 



cut algorithm, again the LP relaxation of PI is solved and a 
lower bound from the optimal LP solution is obtained. But 
this time we set LB = Z LP . If the solution is integer, then the 
algorithm terminates with the optimal shift schedule. Other- 
wise, the following integer program is solved in order to 
obtain a lower bound on the objective value in this case. 



Problem P2 



subject to 



min E^/, 



£c,r A ^LB 1+ <5 
Y h ^ 0 and integer for h e H, 



(A2) 



(A3) 



where Y h is an integer variable defined as the number of 
employees assigned to shift type h,h e H. In the first 
iteration of the branch and cut algorithm, d is set to zero 
in Step 1. And in the subsequent iterations, d = 
m * n fti.A2e//{l c /ii ~ c h2\}- Tn e objective value of an integer 
optimal solution to P2 provides a lower bound LB on the 
value of objective function (3) since the demand constraints 
(4) and the break constraints (5-7) are not taken into 
consideration. Note that when all shift assignment costs 
are equal and set to c h = I , h € //, solution of Problem P2 
can be easily determined as described in Section 3. With the 
lower bound LB obtained from the optimal solution of 
Problem P2, the following objective value cuts are added 
to Problem PI. 



E c k X k = LB 

keK 



E E c k U v = LB 



E E c k^ V kj = LB 
keKjeBL k 



E E c k V kj = LQ 

k<=KjzB2 t 



(A4) 
(A5) 
(A6) 

(A 7) 
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(AlO) 



To update upper bounds for the variables X k in this case, 
from (19) we obtain 

c min E (SLj-Wj^c^Zb, (A8) 

where c min is as defined before. But since c, > c min , j € K, 
£ cXSL - 4)Xj ^ c min £ b, (A9) 

And since SZ. max SL,, € K, we obtain 

£ Cj Xj> — -7 

Now combining (AlO) with (A4), the following upper 
bound is obtained. 

LB^C^E^V^max (An) 

If an optimal solution to PI is not found in an iteration of 
the branch and cut algorithm, the lower bound LBj is 
updated in Step 5 by solving Problem P2. The new lower 
bound is then used to update both constraints (A4-A7) and 
the upper bounds. 
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IMPLICIT MODELING OF FLEXIBLE BREAK ASSIGNMENTS 



IN OPTIMAL SH1F 



SCHEDULING * 



STEPHEN E. BECHTOLD and LARRY W. JACOBS 
Florida State University, College of Business. Tallahassee. Florida 32306-1402 
\ortfnrn Illinois University. College of business. DeKalb, Winds 601 i 5 

The labor scheduling literature has demonstrated that the use of flexibility in designing employee 
schedules can result in a substantial improvement in labor utilization. This paper presents a new 
implicit integer linear programming formulation for the inclusion of meal /rest-break flexibility. 
Although the use of flexible break assignments in labor starring decisions has been of research 
interest since an early article by Segal ( 1 974 ), due to problem size, the majority of related research 
has involved the use of heuristics. An experimental analysis using four different labor requirements 
patterns and ten shift-length combinations demonstrated that, when flexible break assignments 
were modeled, the implicit formulation was superior to the traditional set-covering formulation 
with respect to 1) execution time. 2) computer memory requirements, and 3) the ability to 
produce optimal integer solutions to larger problems incorporating greater flexibility. Finally, a 
number of possible extensions of the implicit modeling approach for use in other labor scheduling 
environments are identified. 

(SERVICE OPERATIONS; LABOR SCHEDULING: INTEGER PROGRAMMING) 

]. Introduction 

Service organizations frequently consider a large number of alternative work shifts in 
the development of labor schedules. These alternatives are typically developed through 
the use of variations in starting times, finish times, and the placement of breaks. While 
the existence of a large number of shift alternatives increases the complexity of the sched- 
uling task, it also offers an opportunity to improve labor utilization (Bvrne and Potts 
1973; Showalter and Mabert 1988). 

This paper is focused upon the problem of obtaining optimal integer solutions in shift 
scheduling environments where a high degree of scheduling flexibility is desired. Specif- 
ically, we introduce a new implicit integer programming approach to modeling break- 
placement flexibility. 

Integer programming approaches to the shift scheduling problem have traditionally 
been based upon the set-covering formulation originally proposed by Dantzie ( 1954). If 
break placement is not modeled, solutions can be efficiently obtained with a network 
flow approach (Segal 1974). However, Thompson (1988) has concluded that it is im- 
portant, relative to the objective of minimizing the amount of labor scheduled, to model 
at least some subset of the allowable break options within the optimization model. 

Unfortunately, the use of flexible break assignments can dramatically increase the 
number of integer variables required in the set-covering formulation. For example, in a 
telephone operator shift scheduling environment, over 98 percent of the 7,120 shift vari- 
ables associated with allowed shifts were required as a direct result of modeling flexible 
break assignments (Henderson and Bern 1976). The set-covering formulation has gen- 
erally been found to be intractable with respect to obtaining optimal integer solutions in 
such environments. Thus, given the expected benefits of scheduling flexibility, organi- 
zations have tended to prefer heuristic solutions to large problems over optimal solutions 
to small problems (Glover, McMillan, and Glover 1984; Glover and McMillan 1986: 
Holloran and Byrn 1986; Taylor and Huxley 1989). 

* Accepted by Stephen C. Graves; received September 23. 1988. This paper his been with the authors 5 
months for 1 revision. 
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A limited number of research papers have appeared in the literature which have direct!! 
addressed the size related problems associated with modeling of flexible break assignments 
Segal (1974) developed a three-phased heuristic which used the solutions of network 
flow models as the basis for an iterative break assignment procedure. Moondra ( I97 6 > 
attempted to provide some degree of flexibility, without increasing the number of problem 
variables, by assuming that half of the employees in a given shift would take their break 
in one planning period with the remainder of the employees taking their break in th* 
immediately following planning period. Keith (1979) developed linear programming 
relaxation heuristics where only one of the allowed break assignment combinations wW 
initially included in the model. Adjustments in the actual break assignments were mad, 
during the heuristic phase of the procedure. ~ 

Gaballa and Pearce (1979) developed an integer programming formulation which" 
modeled break-placement flexibility by including a separate break variable for everv 
period for which a break is allowed for each shift. Although the model incorporated a 
unique concept, the associated number of integer variables was greater than the number 
which would have been required by an equivalent set-covering model. 

The development of the new implicit modeling approach was based upon the realization 
that it was likely that substantial reductions in model size could only be achieved by 
reducing the information requirements of the model. A large reduction in the information 
requirements was achieved in the new modeling approach through implicit representation 
of break assignments for all shifts. Specifically, this was accomplished bv associating 
break variables with planning periods as opposed to shifts. A particular break variable 
represents the total number of employees starting a break in its associated planning ' 
period. Thus, break assignments are not made until the optimal solution to the implicit ' 
model has been obtained. 

The new implicit model eliminates the major disadvantage in Gaballa and Pearce'* 
(1979). Moreover, the definition of break variables in the new model as the number of ? 
breaks initiated in appropriate planning periods permits the modeling of breaks with * 
durations greater than or equal to the length of the planning period modeled. 

In the following section, we define critical terms, state our initial modeling assumptions * 
and present the new implicit model. §3 presents the results of two experimental analyses 
which were used to compare the implicit and set-covering approaches to shift scheduling 
The first provides a comparison of problem sizes and execution times for the two ap- 
proaches across a variety of operating conditions. The second demonstrates the robustness 
of the implicit modeling approach with respect to its capability to generate optimal so- 
lutions under conditions which violate a key assumption. §4 summarizes the experimental 
results and identifies a number of useful extensions of the implicit modeling approach. 

2. Modeling 

In this section, we develop a new model in which the assignment of a single break to 
each employee scheduled is implicit in lieu of its explicit representation in the traditional 
set-covenng formulation. Since a single break is being modeled, it mav be assumed to 
be a meal break. 

For purposes of comparison, we display the Dantzig set-covering formulation for shift 
scheduling in §2.1. §§2.2 and 2.3 provide the necessary foundation for the new model 
which is developed in §2.4. The break assignment algorithm is described in §2.5. 

2.1. Set-Covering Formulation 

The Dantzig set-covering formulation (PI ) for shift scheduling is typically expressed: 

minimize £ Cj yj, (l) 
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(2) 



where 

J = the set of indices associated with allowed shifts. 
I = the set of demand periods. 

\j = the number of employees assigned to the jth shift, 
c) = the cost of an employee assigned to the / th shift. 
1 if period / is a work period in shift j % 
0 otherwise, 
the labor requirement in period /. 
Note that formulation PI appropriately models applications where the break is one or 
more planning (demand) periods in duration. 



an = 



r, = t 



Definitions 



The following terms are essential for model development: 

( 1 ) Planning Period— The smallest time interval for which forecasts of labor require- 
ments have been established (typically 15 minutes to one hour). The use of the term 
"planning period" can be further clarified by noting that the operational day typically 
consists of a number of contiguous planning periods of equal duration. 

(2) Shift— The detailed specification of the planning periods to be allocated to work 
and rest respectively, 

(3) Work Span— The contiguous set of planning periods within which either work or 
rest must take place for a given shift. 

(4) Shift Length— The total number of planning periods assigned for breaks as well 
as work for a given shift. 

(5) Shift Type— The set of allowed shifts with the same work span. 

(6) Break Window— The set of contiguous planning periods within which a break 
may begin for one or more work spans. 

( 7 ) Extraordinary Overlap— The existence of the following condition: The initial and 
final periods in the break window for any one shift type are earlier and later respectively, 
than the corresponding initial and final periods in the break window for at least one other 
shift type. 

Note that a shift type is created by association of one or more break windows with a 
specific work span. 

2.3. Initial Assumptions 

We begin the development of the new model w ith the following initial assumptions: 

( 1 ) The organization operates less than 24 hours per day. 

(2) All planning periods are of equal length. 

(3) Even employee will receive exactly one break. 

(4) The duration of breaks is identical for all shift types modeled. 

(5 ) The duration of breaks is one or more planning periods. 

(6) Each shift type is defined by the assignment of a single break window to an as- 
sociated work span. 

( 7) The break window for each shift type consists of any selected nonzero subset of 
periods subject to the restriction that all periods contained within the break interval must 
be a subset of the work span associated w ith the shift type. 

(8) Extraordinary overlap does not exist. 

(9) No understaffing is allowed. 
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These assumptions do not present a severe limitation upon the more general application ^ 
of an implicit modeling approach for a number of reasons. Numerous service organization^ 
exist w hich do not operate on a continuous basis Assumptions!^ *j 

# 'JSgEBoratcd in the vast majority of published appl ications and resea7dT in~E 
' N scheduling . " " * : . 

v_ Assumption 8 must hold in order to euarantee that the resulting pl acement of break* - i H 
A Wl " bc Wlthin prescrib ed break window foralTshifts. However, this assumption M H 
not appear to present a serious issue for three reasons. First, we demonstrate in a later ^ ^ 
section of this paper that the implicit model appears to be capable of generating thelP-? 
desired optimal solutions even when a substantial degree of extraordinary overlap exists 'fflL 
Second, extraordinary overlap cannot exist unless a break window for one shift is at least 4$*§ 
two periods longer than the break window for a second shift. Finally, the existence dfffil 
this latter condition does not necessarily result in extraordinary overlap. 6^^^ 
^ As opposed to being restrict i ve , assumptions 5, 6, and 7 collectively allow for consid ^^l 
/ erable flexibili ty in the duration and placement of breaks Rpbv.ti.nc ^r^ Mmptionj ; 3 

and 9 can be modeled with only minor modifications in the model to be presented hitf&J ' 
the next section. « 



2.4. Implicit Formulation ( P2 ) 
The implicit formulation (P2), is 

minimize 2 CjX j9 

s.t. SXi-JOi^r, for all /EP, 

Z Fk - Z G k > 0 for all but the last element ArGN, (5)'l|lp 
Z Rk - 2 S k > 0 for all but the first element k G M, 

a> EX, A,GB>0, integer, 

where 

I A = the sum of all elements contained in any set A, 
T = the set of indices associated with allowed shift types, 

Cj = the labor cost associated with one employee working the jth shift type, JS 
Xj = the number of employees scheduled to work the /th shift tvpe 

x.-U > |#ew J } i r'-m 

Wj = the work span associated with the >th shift type, *v* 
X={A>|yGT}, 

fli = the set of break variables for which period / is a break period. 

n = labor requirement in period /, ^ 
P = the set of planning periods in the operating day, 

M = the set of initial (earliest) periods (ascending order) in the break windows as- 
sociated with all shift types. 

N = the set of final (latest) periods (ascending order) in the break windows associated 
with all shift types, 

p = the earliest period in the set of periods within which the break for anv shift may * * 
begin. < 

q = the latest period in the set of periods within which the break for any shift may t 
begin, < 

bi - the total number of breaks initiated at the start of period / by the complete set of 
employees from all shift types. 
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Vj - the set of periods associated with the break window for the yth shift type, 



h\ = { bi\k £ N and /=/>./»+ I •• - k } , 
R k = { h k | A" E M and / = k. k + 1 - . .<y|, 

S k - {Ai|UjSR k } f 

P b = {/>,/> + + 2- • -q- \. q\. 
Constraint set 4 ensures that demand requirements are met. Constraint sets 5-7 can be 
thought of as control constraints which are included to ensure that the shifts implicitly 
represented in formulation P2 are the explicitly represented shifts in formulation PI. 
Specifically, constraint set 5 is a "forward pass" set which ensures that sufficient quantities 
of breaks are appropriately available for allocation to subsets of shift types with break 
windows which are fully contained within the successively larger intervals (/?,/? + 1 , ... , 
k) with k E N. The function of constraint set 6, the "backward pass" set, is similar to 
that of constraint set 5 except that the successively larger intervals are (k, k + K • • -q) 
with k E M. Note that the intervals for constraint set 5 always include period p while 
the intervals associated with constraint set 6 always include period q. Taken together, 
constraint sets 5 and 6 ensure that each of the shifts represented in the optimal solution 
can be assigned a break which is within its prescribed break window. Constraint 7 ensures 
that exactly one break is available for each employee scheduled. 

A proof of the equivalence of PI and P2 is contained in Bechtold and Jacobs ( 1989) 
which was based upon the assumptions in §2.3. In the context of this paper, the term 
equivalence is used to indicate that the two models represent the same set of allowed 
shifts and that the optimal objective function values for the two models are equal. 

Defining A as the number of elements in any set A, the numbers of constraints (NC) 
and variables (NV) in PI and P2 are 



NC(P1) = />, 
NV(P1)= 2 V h 

NC(P2) = P+A/ + A'- 1, 
NV(P2) = T+B. 



(8) 
(9) 

(10) 
(ID 



(10) provides further insight into the development of the control constraints. Specif- 
ically, a forward pass constraint is included for all periods in N except the latest period. 
A backward pass constraint is included for all periods in M except the earliest period. 
Thus, there are N - 1 forward pass constraints, M - 1 backward pass constraints, and 
1 equality constraint for a total of M + N - 1 control constraints. 

In a general sense, the actual number of required control constraints is dependent 
upon the structure which results from the interaction effects of: ( I ) allowed shift lengths, 
(2) placement of shifts, (3) break window design, and (4) placement of break windows 
within shifts. However, if a shift is allowed to start in any period in which its length does 
not result in scheduled work beyond the latest period in the operational day and all break 
windows are contiguous, the total number of control constraints in P2 is 



NCC(P2) = 2(P-D)+ I, 



(12) 



where D = the duration of the shortest allowed shift length. Thus, when equation (12) 
applies for a given operational day, the number of control constraints is determined solely 
by the duration of the shortest allowed shift length. This implies that the number of 
required control constraints is unaffected by the number of different shift lengths modeled. 
If break flexibility is not being modeled, then NC(P2) > NC(Pl) and NV(P2) 
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> N V( PI ), and there is no general advantage in using P2. However the relative advantaged 
in using formulation P2 increases dramatically with increases in VjJE. Vj and increases^ 
in T. In many scheduling applications. 7 is a function of both P and the number of -If 'f* 
allowed shifts. Specifically, shifts arc typically allowed to begin in all planning periods *^V$<"1 
except for those which would result in scheduled work beyond the end of the operating Jl|f^ 
day. If we assume that this latter condition holds, and define a = the number of different 
shift lengths allowed and l k = the length of the k\h shift, then T = Zj-i (P - 1* + 1 ). 

The equality constraint in P2 may be exploited whenever the variable density of any |f 
constraint in set 5 or 6 is greater than half of the total variables (T+ B). Such constraints ^ 
in set 5 can be replaced with their corresponding complementary constraints « *\ 

Similarly, high density constraints in set 6 may be replaced with their correspondiiig 7 .^?Cj 
complementary constraints v if ' v - 




2 F k - 2 G k < 0. 



(14) 



2.5. £rewA' Allocation with P2 

The computational advantage inherent in P2 is conceptually achieved by requiring 
the model to produce less than the complete set of information required for the deter-^^^ 
mination of actual work schedules and planned break assignments associated with the^Pf^ 
optimal work force. Specifically, the breaks determined by P2 (b* , / = p to q) must be Iffl '$ 
assigned to the employees represented by shift type variables xf J = 1 to T. A simple $g 
and efficient procedure for accomplishing this allocation is: >| 

( 1 ) Set bi = bf for / = p to <?, and xj = jc* for j = \ Xo T. s 0 | 

(2) Set / = /?-!. 

(3) Set i = i a + 1. 

(4) If b, r = 0 and / < <?, go to 3. If ft, = 0 and / = <7, stop. Otherwise, go to 5. 

(5) Define Z = {xj\Xj > 0 and / E Vj} with elements * y arranged in nondecreasing 
order according to the criterion of the latest period within Vj. Ties are broken arbitrarily. 

(6) Define x) = the lowest ranked element of Z. l f 

(7) Set a = min { />,, x)} , 6, = A, - a, and .vj = a, - a. 

(8) IfA, = 0, go to 3. If a*} = 0, go to 6. 
If P2 modeling assumptions are met. all such assignments are guaranteed to be within 
the domain associated with the shift decision variables represented in the corresponding . ^ | 
formulation PI. 



3. Experimental Analysis 

This section provides a preliminary comparison of formulations PI and P2 in a hy- 
pothetical work environment which is described in §3.1. In §3.2 we examine the CPU 
execution times of PI and P2 which were associated with the solution of 40 test problems 
based upon all combinations of four labor requirements (demand) patterns and ten sets 
of allowed shift lengths. §3.3 provides a comparison of problem sizes and memory re- 
quirements for PI and P2 across the same 40 test problems. §3.4 presents the results of 
a preliminary analysis of the equivalence of P2 schedule assignments to those allowed 
by PI when assumption 8 was violated. 

All computer analyses included in this section were executed on a CDC CYBER 850 
computer. All formulations of PI and P2 were executed using the GOMORY cutting 
plane option in MPOS (Cohen and Stein 1978). Complementary constraints, as defined 
by equations ( 13) and ( 14), were used in the execution of P2 whenever they resulted in 
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lOW ™t'? alriX denS ' ty - ThC bre3k alloca,ion Procedure described in §2.5 was programmed 
in FORTRAN 77 and executed for each of the 40 MPOS solutions obtained for for- 
mulation P2. 

3. 1 . Operational Environment 

For purposes of analysis, we consider the task of shift scheduling for a hypothetical 
organization which has previously determined the labor requirements (demand) for an 
operational day composed of 24 consecutive planning periods. The actual length of a 
planning period may be assumed to be any convenient duration which is less than 
one hour. 

We consider four different labor requirements patterns in the subsequent analyses All 
demand patterns were set with a mean demand of 25 persons /period with a total demand 
of 600 across the 24 planning periods. Minimum and maximum demands across the 
four patterns were 5 and 45 respectively. The first pattern (Demand 1 ) was unimodal 
across the work day with lowest labor requirements in the initial and final periods The 
remaining patterns (Demand 2 through Demand 4) were all sinusoidal with one, two 
and three planning period(s) with peak requirements respectively. 

The selection of shift-length combinations to generate relative performance charac- 
teristics for the two modeling approaches was based upon the use of reasonable minimum 
and maximum shift lengths while simultaneously varying the size of the test problems. 
The measure of problem size used in this selection process was the required number of 
set-covenng variables. The ten specific shift-length combinations used are presented in 
Table 1 in ascending order with respect to the required number of set-covering variables. 
The total number of different shifts represented by each of the ten shift-length combi- 
nations was based upon the assumption that a shift of any allowed length could begin 
in any period which did not result in scheduled work beyond the end of the opera- 
tional day. 

All shifts included a one-period break. As in Byrne and Potts ( 1 973 ) the break window 
lengths were selected such that no break would occur in the first or last period of the 
work spans used. This resulted in respective meal break windows consisting of 7 and 8 
periods for the 9 and 10 period work spans. In order to obtain increases in problem size 
with increasing numbers of allowed shift lengths, break placement for all shifts was pre- 
scribed by: ( 1 ) centering a seven-period break window on work spans with odd durations, 
and (2) centering an eight-period break window on work spans with even durations! 
Although these break windows allow for a relatively high degree of break-placement 
flexibility, the overall scheduling flexibility represented is relatively low with shift-length 
combination 1 and relatively high with shift-length combination 10. 

3.2. Comparison of CPU Times 

Matrix generation, input, and output (M/I/O) times for PI and P2 on the ten shift- 
length combinations are displayed in Table 1 . For a given model, these times were found 
to be identical (within hundredths of a CPU second ) across the four labor requirements 
patterns. Model PI exceeded the memory capacity of the computer installation for the 
last six shift-length combinations. Therefore, the associated output times for PI are not 
displayed in Table 1. 

An examination of Table 1 indicates that PI required more matrix generation, input, 
and output time than P2 for every test problem for which comparative data was obtained. 
Based upon the raw data, the minimum and maximum PI / P2 ratios across all comparable 
pairs of matrix generation, input, and output times were 2.20 and 4.63 respectively In 
general, the times required by both PI and P2 were an increasing function of the required 
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TABLE I 

Matrix Generation and Input I Output Times for P| and?V 



Shift-Length 
Combination 



Available Shift 
Lengths 



Number Matrix 

of Generation Input Time Output Total Time 

Time(M ) 0) Tirne(O) (M/l/O) 



Variables 



P\ 



PI 



P\ 



n 



n 


P\ 


Fl 


P\ 


Ln 


.27 


.11 


.05 


2.27 


.86 


.32 


.13 


.05 


2.45 


.98 


.33 


.15 


.06 


3.10 


1.02 


.37' 


.19 


.06 


3.74 


1.11 


.39 




.07 




1.18 


.45 




.07 




1.33 


.58 




.07 




1.67 


.59 




.09 




1.71 


.76 




.10 




2.17 


1.28 




.18 




3.55 



1 


15, 17 


126 


34 


1.32 


.54 


2 


13, 15 


154 


40 


1.41 


.61 


3 


15. 17. 19 


168 


40 


1.79 


.63 


4 


15, 17. 19.21 


196 


44 


2.15 


.68 


5 


13, 15, 17 


2!0 


48 


1.98 


7-) 


6 


11, 13. 15 


252 


56 


2.07 


.81 


7 


11, U 15, 17. 19 


350 


70 


3.09 


1.02 


8 


9, 11, 13, 15 


364 


74 


2.66 


1.03 


9 


9, II, 13. 15, 17. 
19,21 


490 


92 


4.03 


1.31 


10 


9, 10, IK 12. 13, 
14, 15. 16. 17. 
18, 19, 20, 21 


970 


152 


7.93 


2.09 



.84 
.91 
1.16 
1.40 
1.30 
1.38 
2.12 
1.83 
2.82 

5.93 



1 All times reported are CPU seconds on a Cyber 850 computer. 
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Table 2 displays the mean, minimum, and maximum integer programming solution 
(S) times for PI and P2 across the four labor requirements patterns. The solution time 
measures for P2 included the CPU time required for break allocation. The break allocation 



TABLE 2 

Integer Programming Solution Time and Total CPU Time for PI and P2 l 



Minimum Maximum Mean Total 

Solution Solution Time 

Tim c Time (M/l/S/O) 

PI P2 PI P2 PI p? 



Shift-Length Available Shift 
Combination Lengths 



1 


15, 17 


2 


13, 15 


3 


15, 17, 19 


4 


15, 17, 19,21 


5 


13, 15, 17 


6 


11, 13, 15 


7 


11, 13, 15, 17, 19 


8 


9. 11, 13, 15 


9 


9, 11, 13, 15, 17, 




19, 21 


10 


9, 10, II. 12, 13, 




14, 15. 16. 17. 




18, 19, 20,21 



Mean 
Solution 
Time (S) 



PI P2 



.86 


.17 


.39 


.11 


2.20 


.74 


.81 


.17 


1.23 


.22 


.42 


.11 


1.59 


.26 


.53 


.14 


2 


.84 




.19 




.81 




.27 




1.22 




.34 




1.58 




.59 




2.33 




.81 




3.54 




1.56 



1.71 


.30 


3.13 


1.03 


5.04 


2.23 


4.65 


1.72 


2.46 


.35 


4.33 


1.24 


3.07 


.39 


5.33 


1.37 




2.60 




2.02 




2.22 




2.14 




3.72 




2.89 




3.94 




3.29 




5.97 




4.50 




7.63 




7.09 



1 All limes reported are CPU seconds on a Cyber 850 computer. 

2 Problem size for PI for the last six shift-length combinations exceeded memory limits, and no solution 
times were available. 
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y limits, and no solution 



times ranged from a minimum of .003 seconds for shift-length combination I to a max- 
imum of .141 seconds for shift-length combination 10. Based upon the raw data, the 
mean of the ratio of break allocation time to total P2 solution time was approximately 
1.4 percent. 

Again, in even' instance where comparative data was available, the solution times for 
PI were always greater than the corresponding times for P2 across the three time measures 
displayed in Table 2. The P1/P2 solution time ratios across the four labor requirements 
patterns for the four comparable shift-length combinations ranged from a minimum of 
2.24 to a maximum of 7.26. The mean solution times appeared to be increasing more 
rapidly for PI than for P2 with increases in problem size. Given the lack of solution time 
data for PI on the last six test problems, the support for this observation was not as 
strong as it was for the corresponding observation with respect to Table 1. 

Table 3 displays a summary of PI / P2 time ratios for: ( 1 ) total M/I/O time. (2) mean 
solution times, and ( 3) the sum of M/I/O and mean solutions times. These ratios provide 
further support for the increasing advantage of using P2 with larger problems. Table 3 
also indicates that the mean solution time ratios were always larger than the ratios for 
total M/I/O time. 

3.3. Comparison of Problem Size Characteristics 

Table 4 displays the number of constraints, the number of nonzero elements in the 
A-matrix, and the corresponding percentages of nonzero elements for both models across 
the ten shift-length combinations. For convenience, the total number of variables is also 
reproduced in Table 4. 

Table 4 indicates that PI always required more variables than P2. Moreover, a sequential 
examination of the results for shift-length combination 1 through 10 reveals that the 
number of required variables increased more rapidly for PI than for P2. 

The assumptions made in the development of allowed shifts in this experimental anal- 
ysis match those which apply to equation (12). Therefore, the implicit model required 
a total of 3P - 2D + 1 constraints. The fact that the number of constraints in the implicit 
model is unaffected by the number of shift lengths modeled is evident for all shift-length 
combinations in Table 4. For example, the minimum shift length in combination 4 is 
15 periods and the total number of constraints is 43 (3*24 - 2* 15 + 1 ). 

Due to the common minimum shift length of 9 periods, the total number of constraints 
in the implicit model is the same for each of the last three shift-length combinations. 
Thus, while 3 1 control constraints are required to implicitly model a 9-period shift length, 
the 12 additional shift lengths in combination 10 can be modeled without a further 
increase in the number of constraints. 

The number of nonzero elements associated with P2 was always substantially less than 
the corresponding number associated with PL While the number of nonzero elements 



TABLE 3 



Shift-Length 
Combination 


Available 
Shift Lengths 


Ratio P1/P2 
(M/I/O Time) 


Ratio 
P1/P2 
(S Time) 


Ratio P1/P2 
(M/I/O/S Time) 


1 


15. 17 


2.64 


5.06 


3.04 


2 


13. 15 


2.50 


3.00 


2.70 


3 


15. 17, 19 


3.04 


5.60 


3.49 
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15. 17. 19. 21 


3.37 


6.12 


3.89 
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TABLE 4 

Problem Size Characteristics for PI and P2 



Shift-Length Available Shift 



Number of Percent 
Total Number of Nonzcroes in Nonzeroes in 

Variables Constraints A-Matrix A-Matrix 



Combination Lengths PI P2 PI P2 PI P2 PI 



p: 



1 ' 7 126 34 24 43 1876 516 62.0 3s 3 

2 13. 15 154 40 24 47 1988 618 53.8 3^9 

3 15, 17, 19 168 40 24 43 2632 654 65.3 380 

4 15,17,19,21 196 44 24 43 3192 750 67.9 39 6 
' 13, 15, 17 210 48 24 47 2884 794 57.2 35 \ 

6 11.13.15 252 56 24 51 2968 926 49.1 ^'4 

7 11.13,15.17.19 350 70 24 51 4620 1240 55.0 347 
* 9,11,13,15 364 74 24 55 3864 1260 44.2 310 
9 9.11,13.15,17. 490 92 24 55 6076 1670 51.7 330 

19,21 

10 9,10,11,12,13, 970 152 24 55 12236 2004 52,6 347 
14, 15, 16, 17, 
18, 19, 20.21 



increased for both PI and P2 from the first to last shift-length combination, the rate of 
growth in the number of nonzero elements for PI was approximately twice that of P2. 

Table 4 also reveals that even though the size of the A-matrix for P2 was always smaller 
than the corresponding matrix for PL the percentage of nonzero elements for P2 was 
always less than the corresponding percentage for PI. These percentages did not appear 
to be a function of differences in the number of set-covering variables. Overall, the nonzero 
density for PI averaged approximately 25 percentage points above that of P2. 

3.4. Experimental Analysis of the Impact of Extraordinary Break- Window Overlap 

This section summarizes the results of a preliminary investigation of the robustness 
of formulation P2 when assumption eight was relaxed. The two break-window structures 



TABLE 5 

Two Break-Window Structures Used to Investigate the Impact of Extraordinary Break- H indow Overlap 
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which were utilized are displayed in Table 5. The break windows in both structures were 
designed such that it was possible for extraordinary overlap to exist between any two 
shifts with lengths which differed by at least two periods. While both structures permit 
a high degree of extraordinary overlap, structure B allows a higher degree than 
structure A. 

Structures A and B were executed with MPOS for all four demand patterns with all 
ten shift-length combinations in Table 1. This resulted in 80 runs made to examine the 
effects of extraordinary overlap. The application of the break allocation program to the 
80 MPOS solutions resulted in the placement of all breaks within the limits set by the 
prescribed windows. 

4. Summary and Conclusions 

A new implicit integer programming formulation was developed in this paper which 
allows scheduling flexibility to be parsimoniously modeled in a shift scheduling environ- 
ment. In order to compare the size and performance of the two models, a preliminary 
experimental analysis was conducted using 40 test problems. Differences in scheduling 
flexibility across the test problems were generated by increasing the number of allowed 
shifts while holding the degree of break-placement flexibility constant. The test problems 
were based upon all combinations of four different labor requirements patterns with ten 
different sets of allowed shift lengths. Integer optimal solutions were obtained for all 40 
test problems using the implicit formulation with commercially available software. The 
set-covering model would not execute on any problem w hich required over 200 variables, 
and thus, integer optimal solutions were obtained for 16 test problems. 

Across all problems which resulted in comparable data, the implicit formulation re- 
quired less CPU time in each of four categories: ( 1 ) matrix generation (M) r (2) input 
to integer programming software (I), (3) integer programming solution (S), and (4) 
output (O). The P1/P2 time ratios for M/I/O ranged from 2.64 to 3.37 across the 16 
problems for which both models executed. The mean PI / P2 ratios for integer program- 
ming solution time across the same 16 problems ranged from 3.00 to 6.12. In general, 
total solution time for either model was an increasing function of the number of shift 
lengths modeled: the rate of increase was higher for the set-covering model than for the 
implicit model. 

The size of the A-matrix required for the implicit model was substantially smaller than 
the corresponding size for the set-covering model for all ten shift-length combinations. 
The nonzero A-matrix density for the set-covering model was always higher than that 
for the implicit model with an average difference of about 25 percentage points. 

In order to control the distribution of breaks, the implicit model requires more con- 
straints than the set-covering model. However, the number of control constraints in a 
typical application for a given operational day is determined solely by the duration of 
the shortest shift to be modeled. Any number of longer shift lengths which are compatible 
with the duration of the operational day may be modeled without an additional increase 
in the number of control constraints. Alternatively, when break-placement flexibility is 
modeled, the number of variables in the set-covering model tend to increase at a much 
faster rate for the set-covering model than for the implicit model. Thus, for one of the 
largest problems in the experimental analysis, the implicit model required 818 fewer 
variables at the expense of 3 1 additional constraints. 

A secondary experimental analysis of the impact of extraordinary break-window overlap 
across shifts was also conducted. The results rev ealed that the actual placement of breaks 
was always within the prescribed break window for all shifts for all 80 test problems. 
Thus, the implicit modeling approach appeared to be robust with respect to its capability 
to produce the desired optimal work schedules. 
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These latter results may be partially explained by the fact that the break window for 
one shift must be at least two periods longer than that for an alternative shift for extraor- 
dinary overlap to exist. While increasing the size of a break window may create extraor- 
dinary overlap, it also increases the number of break assignment alternatives across the 
overlapped shifts. Moreover, if extraordinary overlap does exist between two shifts, the 
break assignment procedure developed in this paper will assign a break to the shift with 
the shorter break window before doing so for the alternative shift. 

Numerous research opportunities exist for the extension of the implicit modeling a p- 
proach for use in other operating environments. For example, shift scheduling for te le- 
phone operators is typically done across 24 hour planning horizons. Moreover, sh ift 
scheduling environments exist in practice for which the organ izat ion desires to includ e 
optimal placement of rest breaks as we ll as meal breaks in shift as"signments ( Henderso n 
and Berry 1976 )7 " " " 

The extension of implicit modeling to tour scheduling environments would appear to 
be very worthwhile due to the fact that tour scheduling problems are generally quite large 
relative to shift scheduling problems. This is due, in part, to the fact that daily shift 
assignments are integral to the task of solving the tour problem which frequently has a 
planning horizon of a week or longer. Given the potential CPU time savings which could 
be expected along with reduced A-matrix size, it may be possible to obtain optimal 
solutions to large tour scheduling problems using an implicit modeling approach. 

All computational results reported in this paper were based upon the use of the all- 
integer cutting plane procedure available in MPOS. Further research using mixed-integer 
cutting plane and /or branch and bound algorithms should be conducted. The experi- 
mental analysis conducted here was limited in part by space considerations, and it may 
be worthwhile to investigate the performance characteristics of the two models across a 
.broader range of break-placement and shift-length flexibility. 

Finally, the availability of an implicit modeling approach should encourage more 
research into the potential benefits of scheduling flexibility. To date, only two studies 
have been published in this regard ( Bailey and Field 1985; Showalter and Mabert 1988 ). 
Due to problem size, the results of these two studies were based upon heuristic solution 
procedures. 1 

1 We wish to express our appreciation to the editors and the anonymous referees for their suggestions which 
resulted in a substantial improvement in the exposition of this paper. 
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INTEGRATED DAYS OFF AND SHIFT PERSONNEL 

SCHEDULING 

James Bailey 

Arizona State University, Tempe, AZ 85287, U.S.A. 

{Received for publication 12 March 1985) 

Abstract— The two levels of personnel scheduling, that is, determination of the days an em- 
ployee should work and determination of the time an employee should start each workday, are 
integrated. The integration allows constraints involving hourly fluctuations in demand and fixed 
work force size to interact. The objective is to minimize the cost of premium pay plus customer 
inconvenience due to understaffing. An optimal solution is attained via linear programming. In 
a realistic service system environment, the model outperformed an integrating heuristic bv 



INTRODUCTION 

In service organizations, output cannot be inventoried. The demand for service requires 
the effort of an employee promptly upon the customer's arrival. As a result, a significant 
problem in organizations such as telephone exchanges, restaurants, retail stores, hos- 
pitals, police, and fire departments is one of scheduling personnel to meet fluctuating 
customer demands. 

The personnel scheduling problem has traditionally been solved at two separate 
but related levels. The ' 4 shift-scheduling' ' problem is to identify the number of 8-hour 
shifts needed to satisfy fluctuating hourly demand over the course of the day. The 
"days-off ' problem is to determine the number of people needed to satisfy the daily 
demands while guaranteeing adequate days off. Clearly these problems are related. 
The solution to the ''shift-scheduling" problem is the input to the "days-ofT ' problem. 
In the reverse direction, staff size and overtime constraints in the "days-off ' problem 
limit the availability of people in the "shift-scheduling" problem. One attempt at com- 
bining these two levels of personnel scheduling is referred to as the "tour" problem 
where five 8-hour shifts with identical start times are scheduled into 168 hrs of fluc- 
tuating demand. It is the intent of this paper to present a more robust solution to the 
integrated personnel scheduling problem. The solution is formed as a decomposable 
linear program with unique exploitable properties. 



LITERATURE SURVEY OF PERSONNEL SCHEDULING 

Models of the "shift-scheduling" problem have appeared in a number of papers. 
Bakerfl] presents a detailed review of many of these techniques. Luce[2, V developed 
a heunstic to select shifts, one at a time, from a "working subset" of acceptable shift 
patterns. The selection criterion was based on the proportion of unsatisfied demand 
met by each candidate pattern. Henderson and Berry[4] presented three heuristics for 
identifying good "working subsets" from all possible shift patterns. With these ac- 
ceptable shifts, they presented heuristics based on L.P. and a neighborhood search 
technique for scheduling shifts. Segal[5] describes a network flow model for optimal 
shift selection. Keith[6] presented an L.P. formulation which minimized the cost of 
understaffing and overstaffing. Baker[l] also suggests L.P. formulations that allowed 
limited deviation from the demand function in both the understaffing and overstaffing 
directions. Koelling[7] presented a goal programming model to include secondary ob- 
jectives such as overstaffing and understaffing. Field[8] presented an L.P. model that 
considered flexible shifts of differing duration. 

The solution to the "days-off ' problem has also received considerable attention. 
A simple heunstic to select work day patterns with two consecutive days off was offered 
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by Tibrewala et al.[9]- MonroeflO] presented a heuristic which did not require en 
secutive days off. Rothstein[l 1] presented an L.P. formulation which maximized th 
number, of shifts with two consecutive days off but accepted shifts with two noncon 
secutive days off. This solution was constrained to exactly satisfy demand on each 
day. " 

McGinnis et al.[\2] presented heuristics to solve the "tour" problem. In his one 
phase heuristic, the intent was to schedule five 8-hour shifts with identical start times 
based on a simple variation of the Luce model mentioned above. These results are 
presented in greater detail because they are used later for comparison purposes Work 
patterns were scheduled, one at a time, by calculating a priority index for each feasible 
work pattern and choosing the pattern with the maximum index. This was continued 
until the total work force was scheduled. The index for a feasible schedule was given 



(1 - Sy/D u ) 2 

-(1 - S 0 /D u ) 2 
0 



if D a >Su 
if D u < S u 
if D t j = 0 



Pit = 2 Pv, 



where 



P k - Priority index for pattern k summed over all periods T* in which pattern k 

is working, 
Pij = Priority index of hour / on day j, 

Sij = Supply of people already scheduled to work in hour i of day j, and 
Dtj = Demand for people to work in hour i of day j, 

INTEGRATED MODEL FORMULATION 

The "days-off ' and "shift-scheduling" problems need to be integrated because 
their constraints interact. To understand this integration consider first the "shift-sched- 
uling" problem. Utilizing Dantzig's[13] set covering formulation, Baker[14] proposed 
the following simple model of this problem 



MinZ, = 2C*x*, 
k= i 



Ki 



S. T. 



2 A lk x ik > D u \ for / = 1, . . . L h 

k= 1 



x lk > 0 and integer, 
where / = Monday, Tuesday, . . . , Sunday, where: 



(1) 



(2) 



(3) 



x ik = number of people scheduled to work shift pattern k on day i. 
C lk = labor cost of pattern k on day /, 

A lk = an (Li x K t ) matrix where a tk = 1 if pattern k is working during period / 

and a ik = 0 otherwise, 
Du = demand for people during period / of day i\ 

Ki = number of feasible patterns on day i, and 

Li = number of periods to be scheduled on day /. 

The above model is constrained to meet or exceed the demand during each period. 
It could also have been constrained to a known work force w ( by introducing slack and 
surplus variables into each period and adding a new constraint that requires the number 
of shifts to equal the available people. The slack variable should also be included in 
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NotfSTAi UnCti ° n t0 ^ C ° Unt f ° r CUSt0mer in ^nvenience cost when understaffed 
Note that .f there IS no difference in labor cost between shift natter™ \TZ ♦ • ' 



MinZ, = X ottf 



/=i 



a:. 



S - T - 2 AtkXik - On + C/« = £>, 7 ; for / = 1, 

Ki 

2 *** = VV/, 
*=1 



where: 



*<* ^ 0 and integer, 



(4) 
(5) 

(6) 
(7) 



Ou - overstaffing surplus during period / of day f 
U it = understaffing slack, 

a, v = customer inconvenience cost per worker due to understaffing and 
w, = available supply of workers on day / ^starring, and 



MinZ = 2 «, 



(8) 
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where: 



(9) 
(10) 



Xj - the number of people working days-off pattern / 
Cj - cost of one person on pattern j, 

A " ' 5hZ™ ,"" ,ere "•" it PMm ' "■*«• « «ay * and 

D t - demand for workers on day /, and 
/ = number of feasible days-off work patterns 

causes lS?cS2S T^Z^^Z ** ^ W °* 
size of the work force. T^^^^^T » thiS e ^ the 
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where: 



Wtotaj = total size of the work force 

r, = customer inconvenience cost of being understaffed by one person on 

Ui = understaffing during day i, and 
Oi = overstaffing during day / 

can b° *o S'^SSSSiLS* 11,6 firS "' 0rliM ° f «* ■*«*• faction 

(•^iStss^ss r '; days - ofr ' 

by adding seven replications J7m UU^fS.^'V'^ tac " 0ns are c ° mt,i «1 

Mi ° z 'k c ' x '*ki^- (.5) 

ST - 2 ^i/ATy - 2 */* = 0; for i = I 7 
y-i *=i ,.../, 



(16) 



^ "-ikx-ik - Uik + u u - Da forl=l,... Li&adi= h 7> 



J 



total, 



(17) 



(18) 



where: 

W^totai = total size of the work force. 
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Fig. 1. Submatrices of the A matrix for the integrated model, 
matrix is sparse and its nonzero portions can be divided into 15 submatrices The An 

X the %l TT* ^ Xj P ° rti0n ° f eqnS (16) and ^ The " columns of A 
reflect the M feasible days-off pattern. Figure 1 , for example, identifies seven patterns 

of eqn (16) and the bottom row represents the 2 Xj in eqn (18). The A, through A 7 

S^^Si , ^ , 2 ^ POrti ° nS ° f (16) ' ^ese'are essentially iZSct 
^th a single row of negative ones. The A 8 through A„ submatrices represent eqn (17) 
The U rows represent the hourly periods of the day. The left hand columns ren 
resent all feasible shifts. For example, the five columns in the ^rO^iZ 

^ " t fr° Ur ^ day V The right hand 2 X L < C0lumns nt the overs affing 

and understating portions of eqn (17). Thus the A matrix has M + 7 x (K- + 2L ) 

columns and 8 + 7 x U rows. For Fig. 1, this represents a (92 x 210) matrix which 

can be handled even by microcomputer based linear programming software 

EXTENSION TO A 24-HOUR DAY 

Figure 1 demonstrates the formulation for a 12-hour work day The model can be 
adapted to work days of various lengths. However, when a 24-h^ 
sidered, a complication arises. A worker who is scheduled to work on two consecutive 

t y n S ta i°, U,d H 10 back - t0 " back ^ and not be given time for suffic n 

men a and physical rest. To guarantee a rest period, the model can be extended to 
Ouee overlapping 12-hour periods covering a 24-hour day. These periods are shown in 
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Midnight Grave-period Noon Midnight-period Midnight 

(— 1 

6 AM Day -period 6 PM 

Fig. 2. Traditional shift periods for flexishift models. 

^m^ni™^ the / easible work P a » e ^s in the A matrix to start and finish in th, 

INTEGER SOLUTIONS 
The formulation given in eqns (15-19) calls for integer solutions The use of li™ 
programing software would be much faster but ma -yield If act oS result ?t 
concern has been discussed by several authorsfl, 8 and 15] BaSdi It al 51 h 
scribed a s.mple procedure that yields optimal integer results provided each ml 2 f" 

SESEtSr EXPerienCe " teSting the integrated SJSSbS £ 

mand vectors and hnear programming software always resulted in integer solutions 

SHIFT ASSIGNMENT HEURISTIC 

SteP L 3 F ( r a°ld3(t) P " reSUlt CrCate 3 Shift requirements table as shown in Figs. 
Step 2. Select the smallest shift requirement with a start time nearest the middle 
of the possible start times. Break ties arbitrarily. In Fig. 3(a) there a e 38 
c shifts which must start at 8 am on Saturday 

Step 3. Idendfy the days-off patterns which require ihe shift selected in step two. 
c» a J" hlg - 3(b) there are shlft Patterns B, C, D and F 

Bi^-^hT aV3iIab,e SCleCt patteras with more ^ ^st. 

Step 5 A skn shift^rh ^ " % 3(b) ' ^ ia pattern F which « days. 

Mep 5. Assign shifts to the selected pattern using the following criteria: 
a. assign shifts selected in step 2 

b ' time? S Fit er ^ h ; ft fK fr0m dther thC eadieSt or the latest ava « ab * start 
tunes. In Fig. 3(a), these are 6 am and 10 am shifts. 

Sten 6 If jySXl ? r C requirements ta We to reflect the assignment. 
Mep 6. If all shifts are assigned, stop, Otherwise return to step 2 



COMPARISON OF MODELS 



To study the efficacy of the integrated L.P. model given bv eons fl5 m cnm- 

JS^^^^n^T^ etal [U] and a modifica «°n of the two phase 
of Tr davs as Jvfntv Lr [I] a " d u F,e,d [ 8 J- The test data were six weeks worth 

company ^7^1^^ based ° n ° perator re <J^ements for a telephone 
company. Week i of the data represented actual demand while subsemient weeks 
represented successively smoothed versions of the actual data SUbSeqUCnt 
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All formulations allowed for operators to work any five consecutive or six days 
in the week. All shifts were eight hrs long allowing for four feasible start times By 
summing the demand for each week and dividing by 40 hrs/week, a target number of 
employees required for each week was estimated. This target number was assumed to 
represent the size of the work force. Labor costs were set at one dollar/hr The cost 
of Saturdays, Sundays and a sixth overtime day were included using standard time and 
a half and double time considerations. The cost of customer inconvenience was some- 
what arbitrarily set at 2.5 times the standard hourly wage so as to make understaffing 
more costly than even the double time wage for Sundays. The measure of performance 
was the sum of the labor cost and the customer inconvenience cost as given by eqn 

Three scheduling heuristics were tested. The integrated model given by eqns (15- 
19) was the first method. A second heuristic was the one-phase method offered by 
McGinnis and presented earlier in this paper. Finally a two step L.P. heuristic was 
tested. All tests utilized the same demand data and feasible shift patterns 

The two phase L.P. formulation was taken from models represented by eqns (4- 
7) and eqns (11-14). First the days-off problem was solved to determine the labor 
avauable for each day. The cost of understaffing required by eqn (1 1) was set at eight 
times the hourly understaffing cost since each person worked eight hrs/day. Once The 
daily staff sizes were determined, seven shift-scheduling problems were solved The 
total cost of the solution was the sum of the labor cost from the days-off solution plus 
the understaffing cost from the seven shift-scheduling solutions 

The results of these tests are presented in Table 1. Comparisons are presented as 
total labor plus understaffing costs and as a percent savings of the integrated model 
over the two alternative solutions. On the average, the integrated model resulted in a 
cost savings of 13.56% of the solution given by the McGinnis one-phased method and 
3.76% of the solution given by the two-step L.P. model. One could attribute this slight 
improvement over the two step L.P. method to the integration of eight optimization 
problems into one. The more significant improvement over the McGinnis method came 
in part, from increased flexibility in shift selection. 

The McGinnis one phase heuristic was limited to work patterns that started the 
same time during each day. This resulted in 56 feasible shifts. When multiple starts 
have been allowed, as in the integrated L.P. model, there are 35,840 feasible shifts 
which would be very costly to search each time a new shift was to be scheduled The 
optimization nature of the integrated L.P. method must also account for some of the 
gain. 

Table 2 contains additional comparison statistics for week 1 data using the three 
approaches. Several observations are possible. First, the two L.P. methods did a better 
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Table 2. Comparison of three approaches using week-1 data and 116 employees. 

Two Step 

Integrated McGinnis L. P. 





periods of understaf f ing 


43 


25 
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periods of overstaffing 
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periods of perfect staffing 
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4. 


total understaf fing man/hours 
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total overstaffing man/hours 
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cost of labor 
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6832 


5336 
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cost of understaff ing 


1932 


1H02 


2073 


8. 


people working 6 day weeks 


18 


IliJ 


1 



formulation averaged .12 min. The McGinnis and two-phased L.P. solutions averaged 
.47 and .32 min of execution respectively on the IBM 3081 computer. The optimization 
nature of the integrated L.P. method must also account for some of the gain. To dem- 
onstrate the shift assignment heuristic, the week-1 solution is illustrated in Fig. 3. The 
output of the linear program is tabulated in 3(a) and 3(b). The heuristics assignments 
would occur as they are listed in 3(c). The resulting distribution of start time variations 
are given in 3(d). In this example the average start time variation was 1.2 hrs with 29 
patterns having the same start time for the entire week. Only five patterns had the 
maximum variation of 4 hrs. 



CONCLUSIONS 

This paper presented an integrated model of the "days-off" and "shift scheduling" 
problem. The model was in a decomposable linear programming form. The constraints 
in the days-off problem such as staff size, consecutive days off requirements, and 
overtime limits can be reflected in the shift scheduling problem. In addition, constraints 
in the shift scheduling problem such as hourly demand variations and changes in the 
cost of understaffing and overstaffing can be reflected in the days-off problem. A heu- 
ristic to allocate required shifts to the days-off pattern is presented. The intent of the 
heuristic is to reduce the occurrence of work patterns with the maximum difference in 
start times over the week. Compared to an existing heuristic and suggested two step 
optimization approaches, the integrated model consistently resulted in lower cost so- 
lutions. For further reading, the reader may refer to refs. [17-21]. 
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easy because the start and enT&^ slS^ — ^ tOUr SChedu,in S is 

weekend. But when dailv pattern Tvarv su/h T"""' ^ n0 WOrk takeS P ,ace 
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overall problem increases dramatic" ^ U '° rkf ° rCe ' th<? ° f the 
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1. Introduction 

A host of issues surround personnel scheduling, in- 
cluding the assessment of labor requirements, demand 
forecasting, service level determination, and workforce 
composihon. Once these issues are resolved, up to three 
mterrelated problems must be solved in order to staff a 
faahty. The first deals with the assignment of on and 
off days to employees (the days-off scheduling prob- 
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nr^wl T ^ COnCemed WUh Ch00sin 8 the hours 
o shifts that an individual will work (the shift sched- 
uling problem). The third considers daily and weekly 
schedules ,n an integrated fashion and tries to develop 

oZ °Tu S nnin8 h0ri2 ° n (the t0ur Ruling 
problem). If shifts are permitted to span more than a 

single 24-hour period, the scheduling problem is said 

to be continuous; otherwise it is called discontinuous. 
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For organizations with fixed daily shifts (e.g., a ma- 
" chine shop that operates from S-to-5, six days a week) 
the onlv problem is that of days-off scheduling to insure 
that each employee works exactly five days a week. For 
organizations with a fixed 5-day week (e.g., utility of- 
fices that close on weekends), the problem is one of 
scheduling the workforce for various shifts during the 
day to meet the expected work load. The integrative 
tour scheduling problem is relevant when management 
has the flexibility to define shifts and assign days off in 
a nonuniform manner. Hospital nurses, telephone op- 
erators, bus drivers, and airline crews are a few examples 
of labor groups where such prerogatives exist. In each 
of these instances, the underlying problem is compli- 
cated by the fact that the demand for service fluctuates 
markedly throughout the day and across the week, and 
that labor contracts often limit the set of available 
choices. 

The purpose of this paper is to present a comprehen- 
sive model of the discontinuous tour scheduling prob- 
lem and to show how it can be solved efficiently for 
real world instances. The model is general enough to 
allow for full and part-time employees with various shift 
lengths, different days-off patterns, flexible break al- 
locations within specified break windows, 4- and 5-day 
work weeks, assignments across labor categories, and 
restrictions on the ratio of full-time to part-time labor 
hours. In the development, we build on the work of 
Burns and Carter (1985) who effectively solved the 
days-off problem, and Bechtold and Jacobs ( 1990) who 
devised a clever way of markedly reducing the number 
of decision variables needed to model shift scheduling. 
The approach taken is hierarchical. We begin with a 
relaxed formulation and find optimal solutions to^the 
accompanying integer linear program. Subsequently, we 
postprocess the outpuT to arrive at hieh qualitv, although 
not necessarilv optimal, tours. 

In the next section, we present some background ma- 
terial and related research. This is followed in §3 where 
assumptions and notation are given along with the 
model. The solution methodology is highlighted in §4. 
An application concerning personnel scheduling at 
General Mail Facilities (GMFs) is presented in §5. These 
facilities typically employ between 200 to 300 clerks 
and machine operators. Individual machines may be 
scheduled to run up to 20 hours per day. Results from 



a full analysis of the Providence, Rhode Island GMF 
demonstrate the efficiency of the proposed methodol- 
ogy. We are able to solve problems nearly one order of 
magnitude larger than previously reported, usually 
within 10 minutes on an IBM 3081-D. 

2. Background and Related 
Research 

Depending on the situation, each of the three scheduling 
problems can be written as an integer linear program 
of the form: 

minimize Z = 2 c > x j O a ) 
/-i 

subject to: 

n 

Za )t Xi*r t > t = l,2 m (lb) 

ri 

Xj > 0 and integer, / = 1, 2, . . . , n (lc) 

where 

n respectively, the number of working-day patterns, 
daily shifts, or weekly tours included in the problem 
under consideration. 

m number of time periods in the planning horizon. 

Cj cost of the working-day pattern, daily shift, or 
weekly tour / . 

r, number of employees required to work in period f . 

Q jt equals 1 if period t is a work period for working- 
day pattern, daily shift, or weekly tour /; 0 otherwise. 

Xj number of employees assigned to working-day 
pattern, daily shift, or weekly tour /. 

The objective function (la) minimizes the total cost 
of the chosen schedule while constraint (lb) ensures 
that each period t is staffed to meet the requirements 
r, . In many applications c, is taken as one, implying that 
the working-day patterns, daily shifts, or weekly tours 
are of equal cost (Baker 1976). Of course, this would 
not be appropriate when overtime or variable shift 
lengths are being considered. The above formulation 
assumes that a set of working-day patterns, daily shifts, 
or weekly tours are available to select from. While such 
a set is usually small for the days-off scheduling prob- 
lem, it can be large for the other two. In such cases, it 
is common to define working subsets. 
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2.1. The Days-Off Scheduling Problem 

Because of its importance, this problem has been studied 
extensively over the last 20 years. Baker and Magazine 
( 1977) investigate a seven-days-a-week operation with 
a fixed weekday demand and a different fixed weekend 
demand. They further consider four possible modes of 
operation where employees are entitled to either: 
— two days off each week; 
— two consecutive days off each week; 
—two consecutive days off each week and four days 
off every two weeks; or 

—every other weekend off and two pairs of consecutive 
days off every two weeks. 

Lower bounds are presented for each of the above cases 
along with a series of algorithms for producing sched- 
ules capable of attaining these bounds, thus achieving 
optimality. 

Bartholdi and Ratliff ( 1978) introduce the concept of 
"unnetworks," which are problems whose defining 
matrices are "opposite" to networks in the sense that 
only two zeros (rather than two ones) appear in each 
column. They noticed that several days-off scheduling 
problems have unnetwork incidence matrices because 
of the fact that employees generally work five days a 
week and take two days off, and that each working day 
corresponds to a 1 and each off-day corresponds to a 0 
for the a it elements in problem (1). They exploit this 
structure by introducing a complementary problem to 
( 1 ) that seeks to maximize the number of workers that 
may be given two consecutive days off during the week. 

In a related work, Bartholdi et al. (1980) note that 
several days-off scheduling problems have "row cir- 
cular" matrices; that is, 0-1 matrices with the ones oc- 
curring consecutively, where the first and last row en- 
tries are considered consecutive. Two cases that belong 
to this category are scheduling with two days off each 
week, and scheduling with two pairs of consecutive days 
off every two weeks with one of these pairs a weekend. 
Through a change of variables, it is demonstrated that 
the matrix of a row circular problem can be transformed 
into a matrix with only two nonzero entries (either +1 
or -} ) in each but the last column y„. If y„ i s treated as 
a parameter the optimal objective function z(y„) is 
known to be piecewise convex, and hence a binary 
search algorithm can be used effectively to bracket the 
optimal solution. 

1126 



In a landmark paper, Burns and Carter (1985) con- 
sider the following general conditions: 

—demand r, (/ = i, 2 7 ) is variable each day; 

—each employee is given at least A out of every B 
weekends off; 

—each employee works 5 out of 7 days; and 
—each employee works at most 6 consecutive days; 
and formulate three lower bounds. What is most im- 
portant, though, is that they present a simple procedure 
that always attains the highest of the three bounds, and 
hence solves the problem optimally. Emmons and Bums 
(1991) extended the approach to include hierarchical 
labor categories, where workers in the higher categories 
can substitute for those in the lower ones but not vice 
versa, and where restrictions on the number of em- 
ployees in each category are stipulated. 

Bechtold (1988) addresses a variation of the days-off 
scheduling problems characterized by multiple locations 
and multiple objectives. In the model, employees at one 
location can be transferred to work at other locations 
on different days of the week. Objectives include min- 
imizing idle time, minimizing the total number of em- 
ployees, and minimizing the number of transfers during 
the week. His approach consists of finding the minimal 
workforce needed to meet the daily aggregate demand 
across all locations. This value serves as input to a series 
of models that are used to determine the actual assign- 
ments and transfers. 

2.2. The Shift Scheduling Problem 

This problem is concerned with assigning employees to 
shifts within a working day. Shift* can have different 
starting and endine times, different durations particu- 
larly when part-timers are allowed, and different break 
configurations for lunch and resting. This means tnat 
the Hire matrix structures encountered in the days- 
off scheduling problem are absent. Because of this dif- 
ficulty, heuristics have been used extensively. The two 
most common approaches are ( 1 ) LP roundoff and (2) 
decomposition, where some of the constraints are re- 
laxed to induce a special structure that makes the full 
problem easier to solve. Exact optimization approaches 
have also been used successfully to solve small to me- 
dium sized problems. 

Beginning with heuristics, Segal (1974) takes a de- 
composition approach. He addresses the continuous 
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scheduling problem of assigning telephone operators to 
daily shifts to satisfy projected demand. The day is di- 
vided into 15-minute periods, and initially the need for 
breaks is ignored. The resulting problem is modeled and 
solved as a pure network. The shifts chosen by the 
model often involve some excess capacity which is used, 
with the help of a second network model, to provide 
as many of the required breaks as possible. If at this 
point some breaks are still unsatisfied, extra employees 
are added. No information is reported on the quality of 
the solutions. 

Henderson and Berry (1976) look at an application 
that admits a very large number of shifts. They present 
two sets of heuristics: the first selects a working subset 
of shifts from the full range of possibilities, and the 
second performs the actual scheduling. The selection 
algorithms include a random shift generator and a max- 
imum difference heuristic which chooses shifts that dif- 
fer maximally from those shifts already contained in 
the working subset. The solution heuristics are based 
on an LP rounding scheme. The authors experiment 
with various working subset sizes in combination with 
the heuristics. Good, and often optimal, results are ob- 
tained for problems with up to 100 variables. 

Bartholdi (1981) considers continuous scheduling 
problems with column (and hence row) circular matrices 
containing "intermittent" ones. That is, the ones occur 
consecutively in each column except for some inter- 
mittent zeros. The first and last rows are assumed to be 
consecutive. These matrices are typical of shift sched- 
uling problems involving continuous shifts interspersed 
with breaks. In his study, Bartholdi devises an LP round- 
off heuristic whose accuracy depends on the instance 
size. 

Using a discontinuous model, Mabert (1979) ad- 
dresses the problem of scheduling check encoders in 
banks. The total volume of daily check arrivals is as- 
sumed to follow a normal distribution from day to day. 
However, the percentage of the checks arriving within 
any given hour is assumed to be fixed. As a consequence,/ 
model (1) now has a stochastic right hand side, r t . 
Chance-constrained programming is used to find so- 
lutions. 

Morris and Showalter ( 1983) present a cutting plane* 
algorithm for optimally solving problems with a rela- 
tively small number of identical shifts that may overlap 
from one day to the next. The objective function of the 
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LP relaxation is rounded up to the next higher integer 
to obtain an initial lower bound. This bound is intro- 
duced into the problem and the LP relaxation is solved 
again. If the new solution is integral, it is also optimal; 
if not, branch-and-bound is used to see if an integer 
solution exists with the lower bound as the objective 
value. If the analysis succeeds, the corresponding so- 
lution is optimal; otherwise the lower bound is increased 
by one, the cut is updated, and the procedure is re- 
peated. 

Finally, Bechtold and Jacobs (1990) present a new 
modeling approach in which shift types rather than 
shifts are the decision variables. A shift type is identified 
by a start time, a shift length, and a break window 
within which the break can start. For the discontinuous 
case, they demonstrate that an often big reduction in 
the number of decision variables can be achieved, but 
at the expense of a new set of constraints needed to 
insure that each eligible shift type receives a break. 

2.3. The Tour Scheduling Problem 

Because the days-off and shift scheduling problems are, 
in effect, special cases of the more general tour sched- 
uling problem, the above discussion applies here as well. 
Some researchers, though, have taken a holistic ap- 
proach. Ritzman et al. (1976) study the problem in a 
post office setting. They use data for a typical week 
broken down into 84 two-hour periods. Several rules 
for choosing tours are defined (for example, one rule 
says add the tour that results in the largest reduction 
of unprocessed mail). In the algorithm, tours are gen- 
erated, assigned a fixed weight reflecting the importance 
of the accompanying rule, and put on a candidate list. 
A biased random sampling scheme is used to select the 
next tour from the list. 

Mabert and Watts (1982) consider service-oriented 
industries that limit inventories. Using check encoders 
at banks as an example, they demonstrate that in this 
environment the number of weekly tours is quite large 
when overlapping shifts and part-time employees are 
allowed. Under certain operating conditions related to 
daily work stretches and fixed cut-off times for encoding 
all the checks, working subsets of tours are randomly 
generated using a biased sampling procedure. The 
probabilities for this procedure are based upon infor- 
mation gleaned from the solution of daily shift sched- 
uling volumes, as well as daily work loads. The analysis 
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consisted of a number of experiments aimed at com- 
paring two factors: ( 1 ) one of four stretch selection rules, 
and (2) different sizes for the working subset. 

Morris and Showalter (1983) use an LP roundoff 
heuristic based on working subsets of tours, and report 
that they were able to obtain a gap of less than one 
percent between the heuristic and LP solutions for 30 
test problems. Love and Hoey ( 1990) use a two-phase 
^decomposition scheme for a fast-food restaurant ap- 
plication but do not present experimental results. Li et 
al. ( 1991 ) consider employees that differ in productivity, 
hourly cost, number of available working hours per 
week, and different days-off constraints. In addition to 
LP rounding of tour variables, they develop two types 
of "constraint violation" heuristics that typically start 
with an infeasible solution and add new tours to achieve 
feasibility. A dropping heuristic is used to reduce 
overstaffing. For a study that highlights the benefits as- 
sociated with nonstandard shift and tour lengths, see 
Mabert and Showalter ( 1 990 ) . 

Loucks and Jacobs (1991) investigate a problem 
where the workers differ in time of availability and task 
qualifications. They present a goal programming heu- 
ristic that first minimizes overstaffing and then mini- 
mizes the deviation between the scheduled and desired 
hours for each of the employees. Their experimental 
design consisted of 24 test problems of moderate size. 
The solutions indicated that anywhere from 30 to 70 
workers were needed to satisfy demand. For a com- 
parison of many of the above approaches, see Bechtold 
etal. (1991). 

Bailey (1985) combines the seven daily shift sched- 
uling problems and the days-off selection problem in 
an integrated formulation which allows for under and 
overstaffing. His model is similar to ours. The output 
of the corresponding mixed integer program is postpro- 
cessed in an attempt to minimize the maximum differ- 
ence in shift start times for each of the employees. Re- 
sults are given for a problem restricted to full-time 
employees, four feasible start times, 8-hour shifts and/ 
40-hour weeks. 

Easton and Rossin ( 1991 ) also take an integrated ap- 
proach and develop a column generation algorithm for- 
constructing working subsets of tours. They begin with 
a heuristic that finds an initial feasible solution. A basis 
is then constructed whose dual variables are used in a 
dynamic program to generate new columns. The dy- 



namic program seeks to compute the reduced cost of 
each feasible tour in order to find the most attractive 
entering column for each employee category. Two sets 
of test problems were examined. The first was charac- 
terized by full-time employees working 9-hour shifts, 
five consecutive days a week with a one-hour break in 
the middle of the shift. The second allowed for part- 
time employees working 4-hour shifts but without 
breaks. In both cases, shifts were required to start at the 
same time every day. Results for various demand pro- 
files yielded near-optimal solutions in reasonable CPU 
time. 

3. Model Development 

In this section, a rather general formulation of the dis- 
continuous tour scheduling problem is presented. The 
goal is to find the minimum cost pool of employees 
needed to meet the forecasted labor requirements during 
each period of the day for each day of the week. We 
start with a preliminary set of conditions, but later show 
how to modify the formulation to handle more complex 
situations. 
Assumptions 

1. The day is divided into 48 ^ -hour periods. 

2. Labor requirements are specified for each of the 
336 periods in the week. 

3. Both full-time and part-time shifts are allowed on 
any day. 

^ 4. A shift must be contained entirely within one dav 
(to accommodate this requirement, a "day" can be con- 
veniently defined as any 24-hour cycle; we use 7:00 am 
as the starting point). 

5. Each full-time employee is entitled to two days 
off per week (not necessarily consecutive) and is as- 
signed a shift of 8 f hours during each of the remaining 
days. 

6. Multiple shift lengths are allowed for each part- 
time employee. 

7. Each shift that exceeds a prescribed duration is 
entitled to a break of one time period which can be 
assigned anytime du ring a prespecified break window. 
ghe break window consists of a set of contiguous plan- 
ping periods. 

1T No two shift types exist such that the break win- 
dow of one is a strict subset of the other. (Note, this is 
f equivalent to the absence of extraordinar y overlay de- 
\fined by Bechtrld and Jacobs.) 
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9. The ratio of full-time to part-time labor hours 
should be no less than p, a lower limit. 

In most operational settings, the classical formulation 
( 1 ) of the problem resulting from these assumptions 
requires over a billion variables. As discussed in §2, 
working subsets of tours are usually generated to reduce 
this number to a manageable level (Henderson and 
Berry 1976, Mabert and Watts 1982, Morris and Show- 
alter 1983). The obvious shor tcomin g r>f enr^ fln fl p 
proach is that attrartivP solution^ m ^y ho /nr^rlfM+H 

in the reduction process. As an alternative, we develop 
an approach based on the work of Bechtold and Jacobs 
on shift scheduling, and Burns and Carter on days-off 
scheduling that implicitly considers all possible tours 
by constructing the tours after rather than before the 
problem is solved. 

3.1. Implicit Modeling of Breaks 
Rather than defining shifts with specified breaks, 
Bechtold and Jacobs simply consider shift types as vari- 
ables, where a shift type is identified by a start time, a 
shift length, and a break window within which the break 
must begin. This leads ?o a big reduction in the number 
of decision variables since a single variable is used to 
describe all shifts of the same type regardless of when 
the break begins. 

In order to describe the approach, the following no- 
tation is needed: 

t index for time period 

j index for shift type; j ' = 1, . . . , n 

Xj decision variable for the number of employees as- 
signed to shift type / 

fit decision variable for the total number of breaks 
initiated at time / for all shift types 

M set of initial periods, ranked in ascending order, 
for the break windows associated with all shift types 

N set of final periods, ranked in ascending order, for 
the break windows associated with all shift types 

p the earliest period a break can begin for any of the 
shift types 

q the latest period a break can begin for any of the 
shift types 

F k { j: break window for shift type ; lies between time 
periods p and k } 

B k { j: break window for shift type ; lies between time 
periods k and q } . 
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Under rather nonrestrictive assumptions paralleling 
those stated at the beginning of this section, Bechtold 
and Jacobs show that each shift will receive a break 
within its window if and only if the following conditions 
are satisfied: 

k 

Z ft ~ Z *,£0, Vt6N\j,j (2a) 
2 * ; *0, V*6M\{p} (2b) 

n q 

2 x, - 2 a = o. (2c) 

1-1 t=p 

Equation (2a) may be thought of as a forward pass con- 
straint which ensures that the sum of the breaks initiated 
during any time span beginning at period p is more 
than or equal to the total number of shift types with 
break windows lying completely within the time span. 
Similarly, (2b) is a backward pass constraint which 
guarantees that the sum of the breaks initiated during 
any time span ending at period q is more than or equal 
to the total number of shift types with break windows 
lying completely within the time span. These two in- 
equalities along with the balance equation (2c) ensure 
that each shift will receive a break within its window. 
Note that the reduction in the number of decision vari- 
ables with respect to ( 1 ) comes at the expense of the 
above constraints. 

3.2. Exact Lower Bounds for Number of Tours 

Burns and Carter derive lower bounds for the number 
of employees needed to staff a single-shift organization 
under several rather general days-off restrictions (each 
employee gets A out of every B weekends off, each em- 
ployee works 5 days each week, and no employee works 
more than 6 days in a row). They further show that at 
least one of the bounds is exact for any instance of the 
problem. This is done using a constructive proof where 
a stipulated algorithm is shown to be capable of achiev- 
ing the highest of the lower bounds. 

In this paper we focus on the case where the only 
days-off restriction is that each full-time employee re- 
ceives two off-days each week. Hence, the correspond- 
ing days-off scheduling problem can be thought of as 
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a special case of the one addressed bv Burns and Carter 
Nonetheless, since the constructive algorithm needs to 
be modified to suit our problem, it is necessary to show 
that the bounds are still valid. 

Consider a days-off scheduling problem with daily 

requirements U (d = 1 7) , and where each em 

ployee works 5 out of 7 days. Two obvious lower bounds 
on the minimum number of employees needed (a,) are: 



so the minimum number of employees * needed is 
max[U, 12], or 3. The initial surpluses for the seven 
days (Step 1) are: 



Requirement, 
Surplus, S d 



1 

2 



where fx! is the smallest integer 



greater than or equal to x, and 

L2 = maxfr,: d = 1 7}. 

The first bound 11 follows from the fact that each em- 
ployee works only 5 days (which implies that 5o> 
2: r d ). The second bound is self-evident. (Burns 
and Carter's third lower bound is not applicable here ) 
Thus we have, 



o> = maxfLl, L2] 



(3) 



which gives rise to the following algorithm for con- 
structing the days-off assignments once u is determined 
from (3): 

Step 1: Initialize S d = o> - r d ; d = 1 7. 

Step 2: Choose any day k such that S» = max{S„- d 

= 1 7}. PutS* *-S k - 1. 

Step 3: Choose any other day i # k, such that S, > 0 
If S d = 0 for all if * k, set / = it. Put S, — S, - 1 
Step 4: Repeat Steps 2 and 3 w times. 
Step 5: If pairs of the form (k, k) axe created, swap 
one element of each such pair with an element of any 
pair ( 1, j) , where i # * and j # it. 
The values of S,in Step 1 represent the surplus number 
of employees above the daily requirements and are to 
be used for assigning off-days. Steps 2 and 3 choose 
the first and second off-days respectively and decrement 
the surpluses of the chosen days; at this point nondis- 
tmct off-days are allowed. After u repetitions of Steps 
2 and 3, Step 4 resolves nondistinct off-day pairs 
through swapping. ' 

A simple example is given to demonstrate the algo- 
rithm. Consider the case where the labor requirements 
for days 1 (Monday) through 7 (Sunday) are: 1, 2 
1- 2, 2, 2, 1, respectively. LI = f^l = 3 and L2 = 2 



Steps 2 and 3 are applied 3 times. In the first pass the 
chosen off-days can be (among other possibilities) the 
pair ( 1, 7). The surplus vector now becomes: 
Surplus S, I121111 

Similarly two other off-day pairs may be chosen as 
(3, 4) and (6, 7), thus completely determining the 
weekly days-off schedules for the three employees It 
is worth noting that in order to schedule as many week- 
ends off as possible, Step 2 can be preceded by sched- 
uling the weekend off for „ - r employees and reducing 
the surpluses for Saturday (day 6) and Sunday (day 7) 
accordingly, where r = min { r 6 , r 7 } . This will not in- 
crease w. The validity of the algorithm is now shown. 

Proposition 1. The above algorithm generates valid 
days-off assignments using w employees, where a, is found 
from equation (3). 

PROOF. By definition, « * r d , d = 1 7, so all 

slacks generated in Step 1 are nonnegative. In Steps 2 
and 3 the existence of days k and i are guaranteed by 
lower bound LI. That is, ZJ., Sd = 7 W - Z2-, r d * 2« 
so the cumulative surplus can provide two days (which 
may be identical) for each employee. It remains to show 
that if pairs of the form (k, k) are generated, enough 
pairs (i, /) exist to permit the swaps in Step 4, where i 
* k and j + k. Notice that the algorithm starts assigning 
pairs of the form (*, k) only when S d = 0, all d A, S k 
> U and one or more employees must be scheduled Let 
e denote the number of remaining unscheduled em- 
ployees (these employees will be assigned off-days of 
the form (k, k)), and let c denote the number of sched- 
uled employees that have day k as one of their off-da v S 
From lower bound LI, the surplus for day Ut this Point 
is at least 2e; that is, 

S, ; - (a> - r k ) - c 2: 2e. And since r t a 0, we get 
(l '~c^2f,orw-c-esc. 
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The left-hand side of the last expression is simplv the 
number of scheduled employees not having it as one of 
their off-days; i.e., employees with off-days of the form 
(/, / ), where i ± k, and / * k. Hence, the proof is com- 
plete. □ 

3.3 The Personnel Scheduling Model 

The model that we developed for minimizing the cost 
of staffing a facility over a seven day planning horizon 
combines (1), (2), and (3) with appropriate modifi- 
cations and side constraints. The following notation is 
used in the presentation. 

Indices 

d index for the day of the week; d = 1, 7 

/" index for shift type 
k, t index for time period 

Parameters 

c 1 prorated weekly cost of a full-time employee 
c] cost of part-time shift type ; 
/„ 1 if full-time shift type / covers period f ; 0 otherwise 
Pit 1 if part-time shift type / covers period f; 0 oth- 
erwise 

p d earliest period a break can begin for any of the 
permissible shift types on day d 

q d latest period a break can begin for any of the per- 
missible shift types on day d 

7, the length of shift type / excluding breaks (hours) 

n f number of full-time shift types 

n p number of part-time shift types 

r d number of periods considered for scheduling on 
day d 

r dl required number of employees on day d, period t 
p full-time to part-time labor ratio 

Sets 

Bl d { /: break window for full-time shift / on day d 
lies between k and q d } 

Fid { j: break window for full-time shift / on day d 
lies between p d and k } 

B p kd {j: break window for part-time shift / on day d 
*ies between k and q d } 

F p kd { /: break window for part-time shift / on day d 
lies between p d and k } 

F d set of all full-time shift types on day d that have 
breaks 



P d set of all part-time shift types on day d that have 
breaks 

M d the set of initial periods, ranked in ascending order, 
for the break windows associated with the shift types 
of day d 

N d set of final periods, ranked in ascending order, for 
the break windows associated with the shift types of 
day d 

Decision Variables 

ft/, total number of breaks initiated on day d in period 
t for all shift types 

x di number of employees assigned to full-time shift j 
on day d 

y dj number of employees assigned to part-time shift 
/ on day d 

o) number of full-time employees needed 

Model 

nf 7 

Minimize z = c f w + £ cf 2 yd, (4a) 

/-I d-1 

subject to: 

2 fit**! + 2 p.**, - ft,, ^ r dt/ (4b) 

d = 1 7; f = 1 r d 

k 

2 A*- 2 2 y d j^o, (4c) 

VArENA {q d } ; d = 1 7 

2 fa - 2 x* - 2 Vdj ^ 0, (4d) 
VfcEM,\{p,}; d=l,...,7 

la 

2 *d, + 2 V- - 2 fat = 0, d = 1 7 (4e) 

/gf* ;6P 4 '-w 

7 

40* > p 2 7 , 2 y dj (4f) 

j nf 7 

^rS2 (4g) 

2 x dit d=\ 7 (4h) 

x d }> Vj / ft- and u integer and nonnegative. (4i) 
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The objective function sums the full-time ( first term) 
and part-time (second term) labor costs. Here we assume 
that if the cost of a particular part-time shift type varies 
from day to day (a weekend shift may require time and 
a half), additional shift types differing only in their 
hourly costs have been included in the model. However 
if full-time shift types with the same length but different 
hourly costs exist, the formulation would have to be 
modified slightly (see discussion in §3.5 for enforcing 
same start times). 

Equation (4b) is a set covering type constraint similar 
to (lb) which stipulates that the sum of the full-time 
and part-time shifts covering period t on day d, less the 
number of breaks allocated in period /, should at least 
match the labor requirement during that period. Equa- 
tions (4c) through (4e) are the forward, backward and 
balance constraints for the seven days of the week used 
to ensure that each shift requiring a break receives one 
Constraint (4f) stipulates that the total full-time hours 
(left-hand side) should be greater than or equal to p 
the full-time to part-time minimum ratio, multiplied by 
the total part-time hours (right-hand side). Finally (4g) 
and (4h) place lower bounds on the number of full- 
time workers, «, needed to satisfy the various shift re- 
quirements during the week. 

Solving (4) yields the minimum workforce comple- 
ment needed to m*et the demand; however, it does not 
speatv the tours to be assigned to the full-time em- 
ployees nor does it enforce any restrictions on" the shift 
Mail times wirhlirrach-four: Hence, model (4) is a re- 
ax,Hon of the full problem implying. tha7the_task of 
to- construction must be done in a postnroceTsing 
phas* In addffion, v^MTProposmon 1 assures that t 
tours are sufficient to staff the full-time shifts, there is 
no guarantee that the fluctuation in the start times for 
each of the tours will be acceptable. This issue is taken 
up in the computations section where, for the problems 
investigated, we found that it was always possible to 
construct valid tours from the solution of (4) When 
uniform start times are called for, a minor adjustment 
in the formulation permits us to find individual tours 
whose start times are the same for each of the five 
working days. 

3.4. Postprocessors 

For a simple scheduling problem involving full-time 
employees only working 8] -hour shifts, 5 out of 7 days 



a week and a ±1 hour start window, it can be verified 
that the new formulation (4) has on the order of 10 ' 
decision variables, compared to (1) which has order 
10 . The major reason for this difference once again is 
that no tours are generated in advance. Rather, the ap- 
proach leaves the task of tour construction until after a 
solution has been found indicating the number of full- 
time and part-time shift types needed along with their 
start times. The concept of implicit modeling of breaks 
in turn, serves to further reduce the size of the problem' 
Nevertheless, in order to explicitly assign breaks to 
shifts, and to convert the daily shifts into weekly tours 
for the w full-time employees, the results of the model 
must be further processed. 

Postprocessor I: Assigning Breaks to Shifts 

Assigning breaks to shifts is a feasibility problem that 
can be easily solved using a transportation model for 
each day of the week, as shown in Figure 1 Here the 
nodes on the left represent different time periods during 
which breaks are initiated, with supplies equal to the 
number of breaks started at each period of day d . The 
latter are obtained from the solution of (4). The nodes 
on the right represent different periods during which 
shifts are started on day d, with demands equal to the 
number of shifts started at each period, similarly ob- 
tained from the solution of (4). An arc (/, /) appears in 
this bipartite graph only when shift type ; contains pe- 
nod i in its break window. Gaps in the node numbers 
imply the corresponding variables were found to be 
zero. 

Because we only need a feasible solution any method 
such as the northwest corner rule will suffice. In order 
to favor the assignment of breaks near the middle of 
shifts, one can minimize an objective function whose 
cost coefficients c, y are given a zero value if period i is 
at the middle of the break window of shift /, and a 
proportionally larger value the further period / gets away 
from the middle. 
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Postprocessor II: Assigning Shifts to Tours 

Although model (4) guarantees that u, full-time workers 
are sufficient to cover all full-time shifts, it does not 
provide an explicit assignment of shifts to weekly tours 
for each of these employees. Accordingly, an additional 
step is needed. In what follows, it is assumed that the 
constructed tours should have as low a variation as pos- 
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Figure 1 Postprocessor I— Partial Network for Assigning Breaks to 
Shifts on Day d 

Break Shift Start 

Time Periods Time Periods 




sible in the start times of their shifts throughout the 
week. This can be achieved using a simple algorithm 
that begins by constructing an initial set of tours and 
then attempts to improve their "quality" by shift swap- 
ping. The procedure is: 

Step 1. Rank the full-time shifts for each day of the 
week in ascending order of their start times. 

Step 2. Use the days-off algorithm in §3.2 to generate 
an initial set of a? tours, scheduling for each day the 
earlier shifts first. 

Step 3. Calculate the variance, of , for the start times 
of the shifts of all the tours i = 1, . . . , u>. 

Step 4. Find tour /* such that: /* = argmax[<r, ? : i 
= 1 n). 

Step 5. Consider all other tours with at least one shift 
starting earlier than the latest starting shift in tour /*, 
and another shift starring later than the earliest starting 
shift in tour i*. If none exists, no improvement is possible 
for i*; go to Step 6. 



5a. For each such tour /, identify the shift-swapping 
pattern with tour r* that will result in the least cumu- 
lative variance for the new tours (the best case might 
be no swapping for all of the shifts). 

5b. Identify tour /* that results in the smallest after- 
swap cumulative variance among all the tours y . Perform 
the swap and update the variances of the new tours. 

Step 6. Repeat Steps 4 and 5 for a predetermined 
number of iterations, or as long as variance reduction 
continues. 

A few points merit clarification. First, in an intuitive 
sense, the likelihood of finding tours with little variation 
in start times is high due to the fact that labor require- 
ments follow similar patterns from day to day (except 
perhaps on the weekends). Second, the condition in 
Step 5 on the earliest and latest times of shifts is nec- 
essary for having variance reduction when swapping 
shifts between tours /* and /, and hence, can be used 
to eliminate redundant computations. 

A number of possible variants of the above algorithm 
can be readily conceived. For example, Bailey (1985) 
attempts to minimize the maximum difference in start 
times over the week for all tours. As an extension to 
our basic approach, we adopted a procedure common 
to tabu search (Glover 1989): rather than terminating 
the algorithm when tour i* fails to improve, tour /* is 
temporarily placed "on hold" for a certain number of 
iterations while other tours of lower variance are con- 
sidered. 

Finally, if rj denotes the number of iterations after 
which the procedure is stopped, then the order of com- 
plexity of the algorithm is 0(2 7 t?u> 2 ) or 0(t)u 2 ). To see 
this, note that in each of the tj iterations, a> comparisons 
are performed in Step 4, up to w - 1 tours may be 
considered for swapping in Step 5, and that at most 2 7 
swap patterns are possible. 

3.5 Extensions to the Model 

In building the model, one of our primary concerns was 
to be able to address staffing issues that may not be a 
consideration now but may arise in the future. Changing 
the full-time to part-time labor ratio is one. This can be 
investigated by simply rerunning the model with dif- 
ferent values of p in equation (4f ). The two other issues 
that we examine are: (1 ) the impact of allowing 4-day 
weeks, and (2) the consequence of enforcing the same 
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daily start times for the full-time workers. These are 
slightly more complex and require modifications to the 
basic model. 

Allowing 4-Day Work Weeks 

In order to model this situation, let xj, be the decision 
variable associated with the 8^ -hour shifts of type j on 
day d for full-time employees working 5-day weeks, 
and let x\ be the decision variable associated with the 
10|-hour shifts of type / on day d for full-time em- 
ployees working 4-day weeks. Similarly, let n n and n f 2 
represent the number of full-time shift type for 5 -day 
and 4-day a week full-rime employees, respectively. Fi- 
nally, denote by u 1 and a> 2 the number of 5-day and 4- 
day full-time employees. For the new model, the set 
covering, forward pass, backward, and balance equa- 
tions may be reproduced as they were in (4), but equa- 
tions (4f) through (4h) need to be replaced by the fol- 
lowing: 

40(^+, 2 )>pV 7; 2 yrf/ 



1 



«' * I ' - i 7 

/•I 



1 



n' 2 7 



/=1 d=l 



n/2 



" 2 ^ 2 xj # d = 1 7. 

The validity of the lower bounds for the 4-day week 
can be shown in a manner analogous to that of the 5- 
day week previously discussed. The only remaining step 
is the replacement of the term c r w in objective function 
(4a) with c f (w l + a) 2 ). 

Enforcing Same Daily Start Times for Full-Time 
Workers 

Assume that v start time are permitted. Corresponding 

to each start time r (t = 1, v) define the variable 

X- T denoting the number of full-time shift types starting 
at time r on day d. Also, let a- 1 , . . . , w v denote the 
number of full-time employees starting their shifts at 
times 1, . . . , v. Equations (4b) through (4e) remain 



the same for the new model but with \ Jr replacing x d ,. 
while equations (4f) through (4h) are replaced by: 

40(* l + ... + v y, I y<, 

« * r Z X dT/ r = 1 v 

u T £ X dTI d = 1 7; r = 1 v. 

In addition, the original term c f w in the objective func- 
tion is replaced by c f (u> 1 + • • • + o> 1 '). If 4-day weeks 
are to be considered, then additional constraints would 
have to be added. 

The final point to be made here is that if we have 
shift types that only differ in their cost, the same mod- 
eling procedure can be used. This might be the case 
with evening or weekend shifts, or in situations where 
it is permissible to assign workers across labor categories; 
that is, where a skilled worker is assigned to a lower 
paving job category for one or more shifts but is still 
paid his higher rate. 

4. Solution Approach 

Research into solving large scale integer programs over 
the last few years has shown that good modeling is 
critical to the solution process. Nevertheless, it has been 
our experience that realistic instances of problem (4) 
cannot be tackled with commercial codes in a straight- 
forward manner. (Our attempts to solve (4) directly 
with OSL (IBM 1991 ) proved unsuccessful.) In light of 
this limitation, one of our main goals has been to de- 
velop a practical procedure that can be used by industry 
for both short-term planning and parametric analysis. 

4.1. Preliminaries 

Inspection of problem (4) reveals that it consists of seven 
shift scheduling subproblems with complicating side 
constraints (4f) through (4h); observe that (4f) and 
(4g) He the decision variables (x di , y di ) together over 
the week. This structure can be exploited by using a 
decomposition scheme that allows us to solve the 
smaller shift scheduling subproblems individually, 
rather than the whole problem at once. 

In order to implement this idea, the model has to be 
modified. Let Xj, be an integer variable denoting the 
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foul number of full-time shifts on day d Moreover 
consider ihe se, , of par ,., ime sh . ft > J"- «. 

> rf/ denote the total n.i™k^ ~r _ _ . 6 . . u iet 



engtn ' e (it is convenient at this point in ,h» 
Jength /. The following equations are added toTodel 



x l = 2 x 4j , d=i 7 



Y 'i = Z y*>, 

/SNf 



d = 1 7; /££.. 



Equations (5a) and (5b) can be thought of as sur 

workforce variables V/ v'„j * aggregate and 
■Me) values X ' f > an J 1' ^ <* 3t SOme ** <* ('ea- 
V * " d u dec omposes problem (4 ) 
^ seven independent shift scheduling subproWents 
with side constraints of the form (5a) and (L rZ 
*, the objective function (4a) become a con ftant^nd 
Zd eS K^ ) - (4h) ^ redu "dant. S c ^e 

constraints. ' *'> SatlS ' yin S ,he follo ^8 

Feasibility Subproblem for Day d 

+ Z f-i T- (6a) 

k 

£*'~&* , ~ l Zk,*'* 0 - V *ei^\{ fc } (6b) 

.?/ A '>5/ rf/ ~i f /^°' ^MA/,,} (6c) 



2 + Z y„, - 2 0« 

' eF < /ep„ „ Pd 

= Z i-i 
^« = Z y J( , /el 

/e.vf 



= 0 



(6d) 

(6e) 
(6f) 



y J; integer and nonnegative. ( 6 g ) 
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4 2. Decomposition Algorithm for Augmented 
Personnel Scheduling Problem 

The problem to be solved is model (4) augmented wirt, 
equations (5a) and (5b). The addition <rf£ ^ "wo 
constraints does not alter the feasible reg* n Z ~° 
£or .algorithm consists of a fruncated^branch and 
bound routine and a heuristic which is invoked when- 
ever an all-integer solution fails to materialize wiZ 

(5a) ^^r^""-^^ 

(5b) J^St^"^ 
i„>o„ . dD,es ** and a nd solve the resultant mixed 

ab 2 x T> r p T am l MILP) K< *»™s ^ ^ 

«Z . ' r " t0 bC ime8ral Temi * at e with the 
opoma solution to the MILP or with a suboptimal so 
tata. if . .certain predetermined CPU time is £££ 
Denote the solution by (X - /, f S„ 5). exceede d- 
(2) (SufrproWms) Fix the variables X/ V' an H . 
their levels found at Step 1. Solve the result " 
shift scheduling subprobLs (m^el (^ = § 1 

' and , ,F STOP°S n fi C f *, iX ' d " *« } - iS im ^ al for *» 
fM Pi cr ' finaI S0 ' Utl0n for P r °Wem (4); 

J.H f T° ke tHe h6Uristic for a » feasible shift 
scheduling subproblems (6). 

The decomposition algorithm fixes the aggregate and 

Ts err z: bles :: ^-^--SSi 

m btep 1. Because the number of inteeer ™ri a ,,i 
relatives small, a depth first search 
with an mteger, although not necessaru/op^o 
ubon to this relaxed problem. Upon completion the 
corresponding objective function value is & + 2 ^ 
x where represents the cost for a part 

at:., ut ,6n f " ' ° n ^ d ' St ^ 2 *«-p" oCd 
ar , al -integer solution for the given values of the ag 

gregate and workforce variables. This leads to the 3- 
lowmg straightforu-ard observation: 

PROPOSmoN 2. If an optimal solution is found for the 
relaxed problem in Step 1 of the algorithm, and if S ep 2 

•vlSTf Wh !? USin§ impliCit aeration, the 
a- ailabihty of good lower bounds is essential for speed- 
ing convergence. An obvious lower bound can be ob- 
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tamed from the LP relaxation of (4). A tighter bound 
re qumngjittle additional work mav b^H^ 
lows, define X j and Y to belh^Hyl^^ 
of full-time and part-time shifts, and soIvp ^ i^^ 
of jnejpers onnel scheduling P robl e m747wl^r7^ 
g^vgg g^long with u are req uired to be ir^T 
Of course, if the branch and Bo7ndlr7s?ep 1 oflhe 

algonthm achieves optimality, then a lower bound to 

(4) is also obtained. 

If one or more subproblems are infeasible at the end 

of Step 2, control passes to the heuristic which attempts 

to achieve feasibility by sequentially adding shifts In 

order to describe its operations, define 

SLACK = 40 | 2 ?' m 
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J>> the slack in equation (4f). This value is used tocheck 
to see it more part-time shifts can be added without 
violating the labor ratio restriction. 

Feasibility Heuristic 

1 (Add Part-Time Shifts) 

(St ° re current so,ut 'on) Set yj, = yj, for all 

lb) Smarting with smallest / E L, cycle through the 
terms Y d , in ascending order. 

lc) For current /,put Y p dl «h- y ' + ] 
IF for shift length /, SLACK < 0, GOTO Step 2. 
ELSE solve the new shift scheduling problem (6). 
IF (x d ft.) is integral, STOP with the final so- 
lu«on(X;.yi. «.*,,, 

else put y^-y 

IF / is the longest part-time shift length, set / to the 
shortest part-time shift length and 
GOTO Step Id. 
ELSE set / to the next longer part-time shift length 
in the cycle and GOTO Step lc. 

Id) (Add permanent shift and start again) Put 
Y m *- Y m + 1, update SLACK and GOTO Step lb. 
2. (Add New Tour) 

2a) Put*-* + l,andsetf5, = Y ? 5 / foraIl/eL 
Update SLACK accordingly. 

2b) (Add four additional full-time shifts) For each 
day i # d of the week, identify the set of part-time shifts 
with the maximum total number of hours that can be 
reduced due to the addition of an extra full-time shift 
From among the six possible days, choose the four with 
the sets containing the highest number of such hours, 
breaking ties arbitrarily. Update y 5, for all / £ L for the 
chosen four days; update SLACK. 

2c) Solve the new shift scheduling problem (6). 

,v/ * f) iS integral ST0P with final solution 

(Xd> Y m , w, x d ., y J; ); 

ELSE, GOTO Step lb. 
The heuristic simply loosens those shift scheduling 
subproblems that failed to reach all-integer solutions in 
the mam algorithm by increasing the total available 
number of shifts one at a time. The problem is compli- 
cated by the requirement of having to maintain the 
min.mum full-time to part-time labor ratio. Step 2 is 
invoked only when any increase in part-time hours 
would violate the minimum labor ratio. In this case, the 
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number of full-time employees. £. is increased by one 
and the total number of part-time shift types are reset 
to their original values. Moreover, part-time shifts are 
eliminated whenever the newly added full-time shifts 
can replace them. 

The final point to be made in this section is that when 
enforcing the same start times for the full-time workers, 
the main algorithm needs to be modified to accom- 
modate the additional constraints introduced in §3.5. 
Fixing the variables u?\ . . . , w 1 ' ties the seven shift 
scheduling problems together precluding the use of the 
decomposition scheme (i.e., the problem no longer de- 
composes into seven independent subproblems). Sim- 
ilarly, the heuristic has to be applied to the whole prob- 
lem at once. 

5. Application— Staffing at General 
Mail Facilities 

In most major population centers, mail collected locally 
and mail arriving from outside the area is centrally pro- 
cessed at a general mail facility (GMF). Each of the 
approximately 280 GMFs in the U.S. is configured with 
a host of mechanized and automated equipment de- 
signed to sort, sequence, and distribute the various 
classes of mail. Processing operations on each machine 
span up to 2 0 hours per day, se ven days a week, with 
much of the remaining time put aside for preventive 
maintenance. 

The models and algorithms presented in this paper 
have been developed with the GMF personnel sched- 
uling problem in mind. The equipment used for pro- 
cessing can be classified into four groups based on the 
required skill level and operator training involved. The 
four groups are: 

— Group 1: Bar Code Sorters ( BCS ) , Multi-Line Optical 
Character Readers (MLOCR), Remote Bar Code Sorters 
(RBCS), and Sequencing Bar Code Sorters (SEQ.BCS). 
— Group 2: Remote Video Encoding Systems (RVES) 
and Multi-Position Letter Sorting Machines (MPLSM). 
— Group 3: Advanced Face Canceller Systems (AFCS) 
and Bundle Sorters (BUNDS). 
— Group 4: Manual operations. 

Because of union agreements, the personnel sched- 
uling problem naturally decomposes into four indepen- 
dent problems by labor category 7 . In each category, we 



are interested in specifying both the daily shifts and the 
weekly days-oft tor each employee. Daily staffing needs 
do not span the usual 8 to 5 pattern; rather, personnel 
may be required to run the various machines throughout 
most of the day and evening in order to get the mail 
processed and ready for deliver}' the following morning. 
Both full-time regular (FTR) employees and part-time 
flexible (PTF) employees must be considered since the 
current labor contract at the U.S. Postal Service ( USPS) 
allows up to 10% of the workforce to be part-time. 

Flex ibility exists in scheduling FTRs with respect to 
their start-rim es andj n the p ositiorung of their lunch 
breaks; the rule being that som ewhere between the 
fourth and th e six th working hour a half-hour break 
musTbe provided . In assigning shifts, it is desirable that 
the same shift, or at least one with approximately the 
same start time (within ±1 hour) be assigned each of 
the working days. Regarding days-off scheduling, each 
FTR receives two, preferably consecutive, days off each 
week. A goal is to have at least one of these two days 
fall on a weekend. 

A PTF can be scheduled at any time during the week 
for up to a total of 39 working hours. Once called, 
though, he or she cannot be assigned to less than four 
hours a day, and if called for six or more hours must 
be granted a half -hour lunch break somewhere between 
the fourth and sixth working hour. Consequently, part- 
time scheduling involves the juggling of a large number 
of shifts. 

5.1. Input Data 

The Providence GMF served as the testbed for the anal- 
ysis. Input data consisted of the number of employees 
needed during each half-hour period for each of the 
four machine groups. These values were derived from 
a series of equipment selection and machine scheduling 
models developed in companion works (see Jarrah et 
al. 1992a, 1992b) and are available from the authors. 
The labor cost for FTRs is assumed to be $12 /hr for 
machine groups 1 and 3, and $14 /hr for groups 2 and 
4. The labor costs for PTFs are $9/hr for groups 1 and 
3, and $12 /hr for groups 2 and 4. In the baseline sce- 
nario the FTRs work 5-day weeks, 8^ hours per day, 
and are entitled to a half-hour lunch break. 

Five shift lengths are permitted for PTFs: 4, 5, 6-J , 7\ 
and hours; only the last three are eligible for a half- 
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start hmes for the FTRs is investigated. 

5.2. Computational Results 

Twenty-eight runs were made to test the algorithms 
Table 1 g 1 ves the problem size and input parameters 



Table 1 Problem Characteristics 



for each of the runs. The four blocks of data are for the 

^^p^2f MU)CR ' RBCS - SEQ - BCS (runs ^ 

?n 1 w M (mnS 8 ~ 14) ' AFCS-BUNDS (runs 15- 
2 1) and MANUAL (runs 22-28). As can be seen, he 

*; Var >' ,n size ^ a factor of two with run 8 
tamg the smallest in both number of variables and con- 

Ln SV? ^ lar§eSt " nUmber °' Variabl «< -d 
run 28 the largest m number of constraints. The average 

subproblem size shows some variability as well, wifh 




Group 2: 
8 
9 
10 
11 
12 
13 
14 

Group 3: 
15 
16 
17 
18 
19 
20 
21 

Group 4: 
22 
23 
24 
25 
26 
27 
28 



806 
806 
806 
806 
806 
863 
819 



908 
908 
908 
908 
908 
988 
926 



1.457 
1.457 
1.457 
1.457 
1.457 
1.607 
1.483 



198/99 
198/99 
198/99 
198/99 
198/99 
218/100 



90-10 rule. 5-day 
85-15 rule. 5-day 
80-20 rule. 5-day 
75-25 rule. 5-day 
70-30 rule. 5-day 
90-10 rule. 5 & 4 
90-10 rule. 5-day 



weeks, flexible start-times 
weeks, flexible start-times 
weeks, flexible start-times 
weeks, flexible start-times 
weeks, flexible start-times 
day weeks, flexible start-times 
weeks, same start-times 



471 
471 
471 
471 
471 
487 
562 



498 
498 
498 
498 
498 
512 
614 



750 
750 
750 
750 
750 
766 
947 



121/66 
121/66 
121/66 
121/66 
121/66 
135/67 



157/82 
157/82 
157/82 
157/82 
171/83 



214/106 
214/106 
214/106 
214/106 
214/106 
236/107 



90-10 rule. 
85-15 rule. 
80-20 rule, 
75-25 rule, 
70-30 rule. 
90-10 rule. 
90-10 rule, 



5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
Way weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5 & 4-day weeks, flexible start-times 
5-day weeks, same start-times 



90-10 rule, 
85-15 rule. 
80-20 rule. 
75-25 rule. 
70-30 rule 
90-10 rule, 
90-10 rule. 



5-day weeks, flexible start-times 
Way weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5 & 4-day weeks, flexible start-times 
5-day weeks, same start-times 



90-10 rule, 
85-15 rule, 
80-20 rule. 
75-25 rule. 
70-30 rule, 
90-10 rule, 
90-10 rule. 



5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5-day weeks, flexible start-times 
5 & 4-day weeks, flexible start-times 
5-day weeks, same start-times 
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the smaller problems appearing in the RVES-MPLSM 
group and the larger ones in the MANUAL group. Recall 
that in runs 7, 14, 21 and 28, the problems are not 
decomposable, and hence, the entries for average sub- 
problem size are left blank. The variability in problem 
size across groups is due to the different number of 
scheduling periods for each of them, while the vari- 
ability within the groups is due to changes in labor rules. 

All algorithms were written in FORTRAN and run 
on the IBM 308 1-D at the University of Texas. OSL's 
simplex based codes were used as solvers. The MPS 
input files for OSL were generated with the algebraic 
modeling language XML (Marsten 1981 ). XML in turn 
requires LINDO-like input files which were created us- 
ing a uniquely tailored FORTRAN code. Storage re- 
quirements for the MPS files ranged from 1 to 2.5 
megabytes. 

The branch-and-bound portions of the algorithms use 
OSL's default subroutine; however, initial branching is 
stipulated to follow the order in which the variables 
appear in Step 1 (master problem) of the main algo- 
rithm, and a depth-first search is enforced until a feasible 
solution is found. Subsequently, OSL is left to follow 
its own branching and node selection rules. This scheme 
proved instrumental in locating a good feasible solution 
early on. The branch-and-bound is stopped if the op- 
timal solution to the master problem is not reached 
within 300 CPU seconds after the LP relaxation is 
solved, except in runs 7, 14, 2 1 and 28 where the cutoff 
time is increased to 500 CPU seconds due to the larger 
number of integer variables involved. Finally, Postpro- 
cessor II (assigning shifts to tours) is stopped whenever 
the number of iterations equals the number of employ- 
ees being scheduled, or if 12 iterations elapse without 
any reduction in the variance of the start times. 

Tables 2 and 3 list the solutions and CPU times for 
the 28 runs, respectively. Column (2) displays the lower 
bounds obtained in each case. These bounds were de- 
rived in one of two ways; (i) from the LP relaxation of 
problem (4); or (ii) from the relaxation involving the 
weekly aggregate variables as described in §4.2 (a 300 
CPU sec limit was placed on these computations as 
well). Column (3) gives the value of the best feasible 
solution uncovered. The percent gap between the upper 
and lower bounds is indicated in column (4), and is 
seen to be consistently less than 1% except for runs 20, 



27 and 28. In particular, the gap was zero ( to the second 
decimal place) in 14 instances and less than 0.2% in 
nine others. 

Columns (5) through (12) report the number of 5- 
day FTRs, 4-day FTRs, 4-hour PTF shifts, 5-hour PTF 
shifts, 6 |-hour PTF shifts, 1\ -hour PTF shifts, and 8^ - 
hour PTF shifts, respectively. As expected, the shorter 
PTF shifts are used more often than the longer ones. 
Column (13) lists the slack, or difference between as- 
signed labor hours and required labor hours; the vari- 
ability in this entry will be discussed shortly. 

In Table 3, columns (14) and (15) give the solution 
times for the LP relaxation and lower bound results, 
respectively. Column (16) gives the CPU time for the 
branch-and-bound (BB1 ) computations in Step 1 of the 
main algorithm. Note that in 2 cases the optimal solution 
was found within the 300 or 500 CPU seconds allotted 
for this phase. This, coupled with the 12 other cases 
where the gap was zero, is strong testimony to the ef- 
fectiveness of the aggregate variable cuts (eqs. 5a and 
5b). Column (17) reports the number of nodes evalu- 
ated during branch-and-bound in Step 1, and column 
(18) gives the node at which the best solution was 
found. 

Column (19) reports CPU times for the branch-and- 
bound (BB2) in Step 2 of the main algorithm. Recall 
that we are attempting to find feasible solutions to the 
subproblems at this stage. Note that for runs 13 and 
21, these computations were not needed; i.e., fixing the 
a gS re g ate a ^d workforce variables at integer values 
yielded a solution in which all the shift variables (x, y) 
were also integral. This was not the case for the other 
runs; column (20) gives the number of subproblems 
that had to be solved in each instance. This number 
was kept to a minimum by recognizing davs with iden- 
tical staffing requirements and aggregate variable values 
(obtained in Step 1 of the algorithm), and solving only 
one subproblem for such days. Problems 7, 14, 21 and 
28 do not decompose into independent shift scheduling 
problems as discussed earlier, and hence, the branch- 
and-bound in Step 2 was applied to the whole problem. 
A corresponding increase in computational burden was 
experienced. 

It is rather remarkable that in all the runs, all-integer 
solutions were obtained at relatively low computational 
cost. Hence, the heuristic (column 21) was never in- 
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Table 2 Results for the Four Machine Groups— Cost and Workforce 
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4 
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voked. These findings are consistent with those of Mor- 
ris and Showalter (1983) for single-day, single-shift 
problems. 

Columns (22) and (23) in Table 3 report the CPU 
times for the two postprocessors. While the first of these 
solved quickly, some effort was needed for the second, 
except for problems 7, 14, 21 and 28, where all tours 
are explicitly enforced to start at the same time each 
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working day. The disproportionate amount of time 
spent on assigning shifts to tours is due to the large 
number of swaps that were attempted to improve the 
quality of the final schedules. In the worst case, the 
running time of exchange heuristics are often exponen- 
tially bounded. We will return to this point presently. 
Finally, column (24) gives the total CPU seconds for 
each of the problems. Solution times range between 
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t Alt computations performed on an IBM 3081 -D. 



27.4 CPU seconds (run 10) and 20.4 CPU minutes 
(run 28). 

Four- and Five-Day Work Weeks 

For each of the machine groups, a substantial cost 
reduction is achieved when 4-day weeks are allowed 
(5.8%, 17.6%, 4.9%, and 10.5%,' respectively). To il- 
lustrate, for group 1, a combination of fifty-eight 5-day- 



a-week and thirty-eight 4-day-a-week FTRs along with 
sixteen 4-hour, two 5-hour, and fifty-eight 6-hour PTF 
shifts obtained in run 6 were able to match the staffing 
requirements more closely than the 102 5-day-a-week 
FTRs with eighty-three 4-hour and seven 5-hour shifts 
obtained in run 1. This is evident in the reduction of 
slack hours from 411 to 146. It can be verified that this 
simple modification translates into annual savings in 
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excels of 1.3 million dollars lor the whole facility (i K - 
^ . nonng the time value of money within the year): 
Enforcing Same Start Times 
When each FTR is required to start his or her shift at 

cos s above the basehne were 0.0%, 1.3%, 0 .1%, and 

no telZ •T 1 *' reSPeCHvdy - " * Curio - »o 

note that for group 1, the cost actually went down by 

$9 from run 1 to run 7. This can be explained by ex 
anunmg columns ( 8 ) and ( 9) in Table 2. As can be 12 
one agonal part-time hour is contained in thHo 
luhon for run 1, implying suboptimality 

Somewhat surprisingly, the results for the baseline 
runs and th,s scenario were quite similar. A partia ex 
planar owes to the existence of extensive slack in 
both cases (column 13). The requirement tha wo k 
ass.gnments be continuous during the day precluded 
the posstbthty of redistributing the slack and eLnating 
°"^ n y™« 0n ' or management is that it may be 
des.rable to .mplement a same start time policy and 
incur the extra cost (a total of $3,381 per week) 
change for a likely improvement in employee mora, 
Most workers prefer the same shift every day. 
Changing the Full-Time to Part-Time Labor Ratio 

of Si USPS lab ° r C ° ntraCt that the "tio 

of full-time to part-time hours be no less than 0 9 The 

queshon arises as to the benefits that can be gained by 
v^TAhe 15 n r ber Fr ° m m -^-nt's 8 point o 

th^'i nil 6XpeCted to be ^--al, 

hen ,t nught not be worth confronting the union on 

™ueh he POtential SaVingS W ^nificant ho:" 
ever, a tough negonatmg stance may be justified. 

« m ? e T? C J 3ngin8 thC mJnimum to part- 

hme raho from 90-10 down to 70-30 in increments of 

19°and 22 Tv" * ^ ° f U5 ' 8 " 12 ' 

that a U h eXamplC ' f ° r W 1 if is °^rved 
that a large reducnon in cost occurs at each 5% incre- 
ment wtth a total of $5,670 per week (or 10.7%) be- 
tween the 90-10 and 70-30 rules. The same result are 

[ ,f : rtheo r maChine ^ U P s correspond 
ngn.duct.on ,n slack with each 5% increment fn the 
abor rule reveals that the saving, stom no, only from 
thelower cost of part-time workers, but also from the 

"nXt ,tyt0mat I Ch ; ab -^^mentswithsh^ 
lengths. The larger pool of four and five hour part-time 
sh;fis makes this possible. 
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F-gure 3 depicts the parametric relationship between 
aggregate cost and the labor ratio for the entire facility 
Notice that the incremental savings per 5% change in 
abor rule is less for the 80-20 to 70 30 interval than 
or the 90-10 to 80-20 interval. The reason is that slack 
eduction tends to become more difficult after a 
amount of part-time shifts are made available. Nev 

lo^l ' 1 6 findin8S P ° im "P the desirab ^»y of ne- 
the 7n n fn , n8eS ^ ^ lab ° r rati ° Since switc "ing to 

Assessment of Postprocessor II 

It is important to investigate the quality of the derived 
schedules with respect to the variation in the daily start 
times for the FTRs. Table 4 gives the number oS 
or each of the 28 test problems, along with statistic 

P D Ld n Tn tOUrS - bef ° re aftef lessor II * 
apphed. These statistics consist of the percent of tours 
havmg the* start times within a two hour band the 
worst (wriest) band for any of the tours, and the pe c n k 
of employees assigned weekends off or two consecutive 
days off (other than weekends). 

As can be seen, in quite a few cases the postprocessor 
js successful in narrowing the spread in employed 
nmes however, this sometimes occurs at tie expense 

days off. To dlustrate, for run 1, while the widest start 
hme vanation was reduced from 11 to 3.5 hours, the 
percentages of weekends and consecutive days off 
dropped from 21.6% and 44.1% to 19.6% and 32 3% 
respectively. In those instances where some tours have 
unreasonable variation in start times (e.g., some of the 

RQore 3 ABect of Changing FTR to PTF Labor Ratio 

Weekly Staffing Cost 

Percent Saving 




90- 10 



85 15 80 - 20 75 - 25 

FTR/PTF Labor Ratio 



70-3C 
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tours associated with runs 26 and 27) more part-time 
or tull-time shifts are necessary to reduce the spread. 

6. Summary and Conclusions 

Labor costs still dominate many organizations. In an 
increasingly competitive environment, good staff 
schedules can make the difference between mere sur- 
vival and a profitable operation. The personnel sched- 
uling problems solved in this paper are the largest and 



most complex addressed in the literature to date and 
are representative of those being solved continually in 
a wide range of industries. Near-optimal solutions were 
obtained in reasonable CPU times for problems with 
up to 1,600 integer variables. 

From a methodological point of view, an interesting 
extension to this work would be to allow different days- 
off patterns for the full-time employees. This can be 
done by appropriately modifying constraints (4g) and 
(4h) (as was done in §3.5 to accommodate 4-day work 
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weeks), and /or by introducing additional constraints 
on a? to reflect the new days-off restrictions (see Burns 
and Carter 1985). Another extension would be to de- 
velop a technique for enforcing bands on the start times 
of full-time employees. It is easy to enforce same start 
times but not bands. We now rely on a postprocessor 
to assign shifts to tours; however, the accompanying 
heuristic offers no assurance that each shift will start 
within a given band. Finall y, the biggest challenge for 
future research is e xtending the methodology to solv e 
t he continuous tour scheduling problem. 

Regarding the main algorithm, we have not yet been 
able to fully explain its efficiency. The feasibility sub- 
problem (6) at Step 2, containing on average over 1000 
integer variables, always terminated with an integer so- 
lution within a minute, and sometimes within seconds 
except for those runs where all seven subproblems have 
to be solved simultaneously. But even there, the solution 
times for BB2 were only a few minutes, and for run 21 
the subproblems did not have to be solved at all. One 
conjecture for explaining these results is that the sur- 
rogate cuts (5) define a facet of the original problem's 
convex hull, thereby increasing the likelihood that a 
solution to the LP relaxation is integral. More work 
needs to be done, though, to substantiate this. 1 

1 A portion of this work was supported by a grant from the Texas 
Higher Education Coordinating Board's Advanced Research and 
Technology Programs. 
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1. Introduction 

A call center (also referred to as a contact center) is 
an important component of the operations of many 
organizations. A significant portion of the cost of 
operating a call center is staffing cost (Gans et al. 
2003). In this paper, we describe a method for finding 
staff- (agent-) level requirements, while simultane- 
ously selecting staff shifts that cover these require- 
ments, that minimize cost while ensuring satisfactory 
customer service. An analytic center cutting-plane 
method guides the selection of staffing levels, and 
simulation is used to report service-level performance 
for each set of staffing levels. 

This problem has received a great deal of attention 
in the literature, and therefore one can reasonably ask 
why there is a need for a computational tool of this 
sort. To answer that question, we first need to describe 
the overall staffing process. There are variations on 
the following theme (e.g., Castillo et al. 2007), but the 
essential structure is sequential in nature and is as 
follows (Mason et al. 1998). 

1. Forecasting: Obtain forecasts of customer load 
over the planning horizon, which is typically one or 
two weeks long. The horizon is usually broken into 
short periods that are typically between 15 minutes 
and 1 hour long. 



2. Work requirements: Determine the minimum 
number of agents needed during each period to 
ensure satisfactory customer service. Service is typ- 
ically measured in terms of customer waiting times 
and /or abandonment rates in the queue. 

3. Shift construction: Select staff shifts that cover the 
requirements. This problem is usually solved through 
a set-covering integer program; see Mason et al. (1998) 
for more details. 

4. Rostering: Allocate employees to the shifts. 

The focus in this paper is on Steps 2 and 3. Steps 1 
and 4 are not considered further. 

Step 2 is usually accomplished through the use of 
analytical results for simple queueing models. Green 
et al. (2001) coined the term SIPP (stationary, inde- 
pendent, period by period) to describe the general 
approach. In the SIPP approach, each period of the 
day is considered independently of other periods, 
the arrival process is considered to be stationary in 
that period, and one approximates performance in the 
period by a steady-state performance measure that is 
usually easily obtained from analytical results for par- 
ticular queueing models. Heuristics are used to select 
input parameters for each period that yield a close 
match between the predictions and reality. For a wide 
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variety of queueing models, this procedure results in 
some form of the "square-root rule/' which is a rule of 
thumb that provides surprisingly successful staffing 
level suggestions. See, e.g., Borst et al. (2004), Kolesar 
and Green (1998), and Jennings et al. (1996) for more 
on the square-root rule. 

The SIPP approach is appealing from the stand- 
point of computational tractability and because of the 
insights it provides. However, there are cases where 
the SIPP approach does not do as well as one might 
hope (Green et al. 2001, 2003). For example, this can 
occur when the service times are relatively long; see 
Whitt (1991) for more on this point. Moreover, call 
centers can be somewhat complex in structure, and 
this complexity can make it difficult to identify a 
queueing model of the center that is both mathemati- 
cally tractable and a reasonable match to the true sys- 
tem. In such cases, simulation is a viable alternative. 
Indeed, simulation is now increasingly used in Step 2; 
see §VIII of Mandelbaum (2003) for many examples. 

Further motivation for the use of simulation in- 
volves the linkage between staffing decisions in adja- 
cent periods. Boosting staffing levels in one period 
can often help in reducing workload in subsequent 
periods, so there can be linkage in performance 
between different periods. Such linkage can imply 
that there are multiple solutions to the work require- 
ments problem that can offer valuable flexibility in 
Step 3. Traditional queueing approaches are not sat- 
isfactory in such cases, and then one turns to sim- 
ulation or other numerical methods. Indeed, Green 
et al. (2001, 2003) solve a system of ordinary differ- 
ential equations through numerical integration to get 
the "exact' 7 results for their models to compare the 
performance of various heuristics. 

Assuming that one uses simulation or some other 
numerical method to predict performance in the call 
center, one then needs to devise a method to guide the 
selection of potential staffing levels to be evaluated. 
There have been several suggestions in the literature, 
all of which explicitly capture the linkage between 
periods in an attempt to realize cost savings. Like 
Green et al. (2001, 2003), Ingolfsson et al. (2002) use 
numerical integration to compute service-level per- 
formance for a proposed set of staffing levels, and 
a genetic algorithm to guide the search. Ingolfsson 
et al. (2007) again use a numerical method to com- 
pute service-level performance, and integer program- 
ming to guide the search. Castillo et al. (2007) devise 
a method for randomly generating sets of staff shifts, 
then use simulation to evaluate the service-level per- 
formance of each set of generated staff shifts, and 
finally, plot the cost versus service level of the poten- 
tial solutions to identify an efficient frontier. Atlason 
et al. (2004) use simulation to evaluate service level 



performance of a proposed set of shifts, and use inte- 
ger programming in conjunction with Kelley's cutting 
plane method (Kelley 1960) to guide the search. 

In Atlason et al. (2004), performance in each time 
period is measured as the fraction of calls responded 
to on time, and analysis relies on the assumption that 
service in a period is concave and componentwise 
increasing as a function of the staffing-level vector. 
To understand this assumption, consider a single- 
period problem. Increasing the staffing-level should 
lead to improved performance, i.e., an increasing frac- 
tion of calls responded to on time. Furthermore, one 
might expect "diminishing returns" as the staffing 
level increases, so that performance would be con- 
cave in staffing level. Empirical results suggest that 
this intuition is correct, at least for sufficiently high 
staffing levels. But for low staffing levels, the empiri- 
cal results suggest that performance is increasing and 
convex in the staffing level. Therefore, performance 
appears to follow an "S-shaped" curve (Ingolfsson 
et al. 2007, Atlason et al. 2004) in one dimension. 

This nonconcavity can cause the cutting-plane 
method of Atlason et al. (2004) to cut off feasible solu- 
tions, and the problem can be so severe as to lead 
to the algorithm suggesting impractical staffing plans. 
Nevertheless, the ability of the Atlason et al. (2004) 
approach to efficiently sift through the combinatorially 
huge number of potential staffing plans is appealing. 
One might ask whether there is a similar optimization 
approach that can satisfactorily deal with S-shaped 
curves and their multidimensional extensions. That 
is the subject of this paper. We replace the assump- 
tion of concavity with a weaker one (namely, pseudo- 
concavity), and use a different cutting-plane method 
that, together with additional techniques, successfully 
handles multidimensional extensions of the S-shaped 
curves alluded to above and seen in examples like that 
depicted in Figure 2; see §4.1 for more details. 

Assuming that the service-level functions are pseu- 
doconcave, an analytic center cutting-plane algorithm 
can be used to efficiently search the combinatorially 
large space of potential staffing plans. In essence, 
the algorithm works as follows. One begins with a 
polyhedron that contains an optimal solution to the 
staffing problem. At each stage, the algorithm selects 
a staffing plan that lies close to the analytic center of 
the polyhedron and runs a simulation at that point 
to determine service-level performance. Depending 
on the results of the simulation, an "optimality cut" 
or one or more feasibility cuts are added, thereby 
shrinking the polyhedron. The algorithm terminates 
when the polyhedron contains no integer points, or 
when the difference between upper and lower bounds 
on the optimal objective is sufficiently small. 
We view the contributions of this paper as follows. 
1. Under realistic assumptions on the service- 
level functions, we give an algorithm for solving to 
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optimality the combined Stop 2-3 problem. The com- 
bined procedure explicitly addresses linkage between 
periods and the impact of shift-based scheduling. 

2. We give conditions under which the algorithm 
provably converges. 

3. We compare the proposed algorithm to what 
can be reasonably viewed as the current best practice 
to better understand the properties of the algorithm. 
This comparison must take place with a model of lim- 
ited complexity so that the existing methods can be 
applied, but is nevertheless quite illuminating. 

The numerical results in §5 show that the ana- 
lytic center cutting-plane method outlined here out- 
performs, or at least equals, the SIPP heuristics in 
every case in which shift structure of staff schedul- 
ing is explicitly considered, which is the setting we 
are interested in. In that sense, it is a robust proce- 
dure that can be applied in a near black-box fashion. 
Of course, these extremely appealing properties have 
to be traded off against the computational cost of the 
procedure, which is not insignificant. We are actively 
considering methods for reducing the computational 
effort of the procedure; see §6. 

The remainder of this paper is organized as follows. 
We formulate the call center staffing problem in §2. 
In §3, we review cutting-plane methods for contin- 
uous problems with pseudoconcave constraints. Sec- 
tion 4 describes modifications for discrete problems, 
such as the call center staffing problem, and proves 
that the algorithm converges. We compare the pro- 
posed algorithm with the SIPP methods described in 
Green et al. (2001, 2003) in §5. Section 6 concludes the 
paper and discusses future research questions. 

2. Problem Formulation 

In this section, we formulate the call center staffing 
problem of minimizing staffing cost over a planning 
horizon while maintaining an acceptable level of ser- 
vice in each of a number of time periods, with par- 
ticular emphasis on the use of simulation to estimate 
the service levels. We say that service is acceptable 
in a period if the long-run percentage of calls that 
arrive in this period and are answered within a cer- 
tain time limit r meets or exceeds a threshold v. This 
is the usual "tt percent of calls are answered within 
t seconds" requirement that is something of an indus- 
try standard, cf. the "80/20 rule" for which tt is 80% 
and t is 20 seconds. 

Define the staffing-level vector as y = (y lf . . . , y p ) T , 
where p is the number of time periods in the plan- 
ning horizon, y { is the number of agents working in 
period /, i = 1, . . . , p, and ( ) T denotes the transpose 
operation. Let be the random number of calls that 
are received in period /, and let S,(y) be the random 
number of those calls that are satisfactorily handled, 



i = We can express the service-level con- 

straint ElSityfl/ElNi] > tt as 

g i ( } ,) = ES i (y)~TTEN i >(), ,' = 1 p. 

Here, £,(y) gives the expected number of success- 
ful services over and above the required expected 
number in period i. (Note that requiring that 
E[ 5 , (y)/N»] > is not a valid way to enforce the con- 
straint because it gives more weight to service on 
days where the call volume is low. Also note that 
we include a separate service-level constraint for each 
period to ensure uniform quality of service through- 
out the planning horizon, which is consistent with 
industry practices.) 

For a staffing-level vector y, there is an associated 
cost of covering the staffing levels with tours. The 
term "tour" refers to a collection of periods that a 
single employee works over the planning horizon. 
A tour must agree with all provisions of employee 
contracts, such as rules on meal breaks and a limit 
on the number of hours an agent can work over the 
planning horizon. All feasible tours are enumerated 
prior to problem formulation and stored in the tour 
matrix A such that A* = 1 if tour ; includes period i, 
and zero otherwise. Also, the ;th component of the 
cost vector c is the cost of the ;'th tour. We let x be 
a vector with jth component Xj giving the number of 
employees working the ;th tour. We assume that there 
are m feasible tours, therefore x e Using this nota- 
tion, the minimal cost of covering work requirement 
vector y is 

/(y) = min c T x 

s.t. Ax>y, 

x>0 and integer. (1) 

We assume that every period is covered by at least 
one tour, i.e., for every / there is at least one ; such that 
A tj = 1. It then follows that (1) is feasible for any y. 

The call center staffing problem can now be formu- 
lated as 

min /(y) 

s.t. &(y)>0 for i = l,...,p, 

y > 0 and integer. (2) 

Recall that we cannot compute the (vector-valued) 
service-level function g(y) exactly, and instead use 
simulation. The call center staffing problem is then 
a simulation optimization problem. As in Atlason 
et al. (2004), we adopt "sample-average approxima- 
tion" (see Shapiro 2003, Kleywegt et al. 2001) for solv- 
ing such problems. This approach, specialized to our 
setting, is as follows. One first generates the simu- 
lation input data sets for n independent replications 
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of the operations of the call center over the planning 
horizon. Each data set includes call arrival times and 
service times, and other input data needed to simu- 
late operations of the call center at hand, if appro- 
priate, e.g., call abandonment times and so forth. The 
simulation data sets are fixed, leading to a determin- 
istic optimization problem that chooses staffing levels 
so as to minimize staffing cost, while ensuring that 
average service computed only over the generated realiza- 
tions is satisfactory. We can then attempt to identify or 
develop deterministic optimization methods to solve 
the resulting problem. 

Suppose that service-level functions £,(y), /' = 
1, . . . , p, are estimated by corresponding sample aver- 
ages gidfi n), where n is the sample size used. The 
sample average approximation of the call center 
staffing problem is then 

min /(y) 

s-t. |i(y;n)>0 for i = l v , 

y > 0 and integer. (3) 

Proposition 1 below gives conditions under which 
solutions to (3) converge to those of the true prob- 
lem (2) as n — > oo. We first make the following natural 
assumption about the cost vector. 

Assumption 1. The cost vector c is positive and inte- 
ger valued} 

Assumption 1 implies that f(y) is integer valued. 
Moreover, because c>0, the z-level set of /, 

{y>0 and integer: 3x>0 and integer, Ax >y,c r x< z], 

is finite for any zeR. 

Let Y* denote the set of optimal solutions to the 
true problem (2). 

Proposition 1. Suppose that Assumption 1 holds. 
Suppose further that for each y, gj(y;n) g t (y) as 
n oo with probability one. Finally, suppose that there is 
an optimal solution y* € Y* that is "strictly feasible/' Le., 
&(y*) > 0 for all i = 1, . . . , p. If y* is an optimal solution 
to (3) for each n, then y* n € V* for n sufficiently large, with 
probability one. 

This result is easily proved using techniques very 
similar to those used in Atlason et al. (2004), and 
therefore we omit the proof. The requirement of a 

1 The integrality part of Assumption 1 can be satisfied in practice 
by multiplying by the appropriate power of 10 if necessary, e.g., 
by 100 if the cost is in dollars and cents. This does not change the 
computational complexity of the problem because the number of 
significant digits to represent the data remains the same. In this 
case, attention should be paid to the choice of the weight param- 
eter w defined in §3 to get a significant reduction in the objective 
value between the associated optimaiity cuts. 



"strictly feasible" optimal solution is easily justified in 
practice: it is almost surely impossible for the service- 
level function g { to be exactly zero when y takes on 
only discrete values. 

Proposition 1 establishes the validity of the sample 
average approximation approach in our setting. Much 
more can be said about the convergence of solutions 
of the sample average approximation problem and 
their objective values, but that is not the emphasis of 
this paper. We refer the interested reader to Atlason 
et al. (2004) and Kleywegt et al. (2001). 

3. The Analytic Center 
Cutting-Plane Method 

In this section, we state a version of the traditional 
analytic center cutting-plane method (ACCPM) to fix 
ideas and provide a departure point for a reader 
unfamiliar with cutting-plane methods in continuous 
optimization. 

There are many cutting-plane methods for solv- 
ing convex optimization problems, including what 
may be termed "boundary methods," such as Kelley's 
algorithm (Kelley 1960) and its extensions (e.g., 
Westerlund and Porn 2002), and interior point meth- 
ods, recently reviewed by Mitchell (2003). We will 
focus our attention on one of the latter, namely, the 
ACCPM, which was first implemented in du Merle 
(1995), as pointed out in Elhedhli and Goffin (2003). 
The ACCPM has proven effective in terms of both 
theoretical complexity (Atkinson and Vaidya 1995, 
Nesterov 1995, Goffin et al. 1996, Mitchell 2003) and 
practical performance on a variety of problems (Bahn 
et al. 1995, Mitchell 2000, and other references in 
Mitchell 2003). Software packages implementing the 
method are available (e.g., Peton and Vial 2001). 

Many versions of the ACCPM for convex feasibil- 
ity and optimization problems have been explored in 
the literature. The description we chose below bor- 
rows from Nesterov (1995), du Merle et al. (1998), and 
Mitchell (2003), among others. 

Consider an optimization problem in the following 
general form: 

min b T y 
s.t. y € Y, (4) 

where Y c R" is a convex set, and b € K". (A prob- 
lem of minimizing a general convex function over a 
convex set can be easily represented in this form.) To 
simplify the presentation, assume that the set Y is 
bounded and has a nonempty interior. 

To apply the ACCPM (or any other cutting-plane 
algorithm), the feasible region Y needs to be described 
by a separation oracle. Such an oracle will, given an 
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input y € U", either correctly assert that y € Y, or oth- 
erwise return a nonzero vector q e R", such that 

9 T Cy-y)>« v y€ y, 

i.e., produce a feasibility cut. (Depending on how the 
set Y is described, the oracle might produce a deep 
or shallow cut, which has the same form as the con- 
straint above, but a positive or negative right-hand 
side, respectively.) 

We now describe a typical iteration of the ACCPM. 
At the beginning of an iteration, we have available a 
finite upper bound z on the optimal value of (4), and 
a polyhedron P = {y eR n : Dy > d) that is known to 
contain the set y. Here, D e R rxn and d e R r for some 
finite r. We first compute the (weighted) analytic cen- 
ter of the set P O [y e R n : b T y < 2} (for ease of presen- 
tation, we assume that the set Pfi {y € R n : b T y < z] is 
bounded), defined as the solution of the convex opti- 
mization problem 

min i-wlog(z-b T y)-Vlog(D / .y-^)L 

(5) 

where D t is the /th row of D and w > 0 is a weight 
constant that affects the convergence rate of the algo- 
rithm (see, for example, du Merle et al. 1998). The set 
Pn {y e R n : b T y < z] is often referred to as the localiza- 
tion set because it contains all feasible solutions with 
objective function values lower than 2. 

Finding a solution to (5) with a high degree of preci- 
sion is a relatively simple task from a practical stand- 
point and can be done, e.g., via Newton's method. 
Let y be the analytic center found. Next, the oracle 
is called with y as the input If y € Y, then, by con- 
struction, b T y < z, and the upper bound is lowered 
by taking 2 := b T y. Otherwise, if y & Y, the oracle 
will produce a vector q providing a feasibility cut, 
which is then added to the description of the poly- 
hedron P. The procedure is then repeated. A slightly 
more detailed description of the algorithm is pre- 
sented in Figure 1. 

Intuitively, the algorithm's efficiency stems from the 
fact that at each iteration the cut being added passes 
through the analytic center of the localization set, 
which is often located near a geometric center. Thus, 
the volume of the localization set reduces rapidly with 
each iteration. 

Suppose that the feasible set Y is specified by 
y = {y € U n : gi(y) > 0, / = 1, . . . , p}, where the func- 
tions gi(y), i = 1, . . . , p, are pseudoconcave, as defined 
below. 

Definition 1 (Cf. Definition 3.5.10 in Bazaraa 
ET al. 1993). Let h: S -+ R be differentiate on S, 
where S is a nonempty open set in R n . The function h 
is said to be pseudoconcave if for any y, y € S, with 
vh (9) T (y ~ 9) < 0/ we have h(y) < h(y). Equivalently, 
if h(y) > h(y), then Vh(y) T (y - y) > 0. 



Figure 1 The Analytic Center Cutting-Plane Method (ACCPM) for 
Problem (4) 

Initialization Start with a polyhedron P° := [y € U n : D°i/ > d°\ 
such that Y c P°, a solution y* € Y, and an upper bound 
2 0 := b r y\ Let ztP > 0, and set the iteration counter k := 0. 

Step 1. If termination criterion is satisfied, then stop, and 
return y' as a solution. Otherwise, solve Problem (5) 
with w = w k , z = z k , and P = P k := (y € D* > d k ); 
let y* be the solution. 

Step 2a. If if e Y, then let i=h r y k , y* :=y\ :=D k , 
and d k+l :=d k . 

Step 2b. If y l £ Y, then generate one or more feasibility cuts 
at y*. Update r> and d k to include the new constraints, 
and let D* +1 and d k + l represent the new constraint set. 
Letz* +, :=z*. 

Step 3. Compute w* +x and let Jc:=fc + 1. Go to Step 1. 

Pseudoconcavity of a function is a weaker prop- 
erty than concavity; see also Bazaraa et al (1993, Fig- 
ure 3.12). 

With Y in the above form, the feasibility cut at point 
y £ Y which violates the ith constraint can be speci- 
fied as 

V*,(y) T (y-y)>0 (6) 

because any solution y that satisfies g,(y) > 0 also sat- 
isfies gi (y) > g,(y). 

4. A Cutting-Plane Method for 
Discrete Problems 

In this section, we describe how the ACCPM algo- 
rithm of §3 can be modified to solve the sample 
average approximation (3) of the call center staffing 
problem (2). 

The most significant modification to the ACCPM for 
continuous problems is needed to take into account 
the fact that the feasible region of (3) is no longer a 
convex, or even connected, because of set the inte- 
grality restriction on the variables. Cutting-plane algo- 
rithms for nonlinear mixed-integer programs have 
been explored in the past; see, for example, West- 
erlund and Porn (2002). However, in this and other 
similar papers, it is assumed that the constraint func- 
tions are in fact differentiable functions of continuous 
variables; the integrality restrictions on the variables 
are, in a sense, exogenous. In such a setting, the con- 
cept of a convex (continuous) nonlinear relaxation of 
the integer program is straightforward, and feasibil- 
ity cuts are generated simply using the gradients of 
these continuous functions. In our setting, however, 
the service-level functions and their sample average 
approximations are not defined at noninteger values 
of y, and devising their continuous extension, espe- 
cially one that is easy to work with from the algorith- 
mic point of view, is nontrivial at best. Therefore, we 
take a different approach in adapting the ACCPM to 
the discrete case. 
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As far as we know, the use of ACCPM as a solu- 
tion method for nonlinear integer programs has not 
been reported, although the method has been success- 
fully used to solve the linear relaxation subproblems 
in branching algorithms for integer programs (see, for 
example, Mitchell 2000 and EihedhJi and Goffin 2003, 
among many others). 

In §4.1, we extend the notion of pseudoconcav- 
ity to functions of integer variables, and show how 
feasibility cuts can be generated assuming that the 
functions gj(}/;n), / = are, in fact, discrete 

pseudoconcave. This leads to an ACCPM method for 
(mixed) integer programming problems satisfying the 
pseudoconcavity assumption; the algorithm is appli- 
cable to the types of problems considered in Wester- 
lund and Porn (2002), for example. We also discuss 
whether the S-shaped form of the service-level func- 
tions in the call center staffing problem is consistent 
with this assumption, and propose alternative feasi- 
bility cuts at points where it is violated. 

Section 4.2 discusses other modifications of the 
original algorithm for Problem (3) and details of 
our implementation. Section 4.3 gives a proof of 
convergence. 

4.1. Discrete Pseudoconcave Functions and 
Feasibility Cuts 

We begin by defining the notions of a discrete convex 
set and a discrete pseudoconcave function. We denote 
the convex hull of the set C by conv(C). 

Definition 2. We say that the set C C Z" is a dis- 
crete convex set if C = conv(C) O 7L n , i.e., the set C 
equals the set of integer points in conv(C). 

Definition 3. Let C c Z" be a discrete convex set 
and h: C -* R. Then, h is discrete pseudoconcave on C if 
for any y € C, there exists a vector q(y) € R" such that 
for any y e C, 

?(y) T (y-y)<0 =» h{y)<h(y). (7) 

Equivalently, if h(y) > h(y), then q(y) r {y — y) > 0. We 
call such a vector q(y) a pseudogradient of h at y. 

Discrete pseudoconcavity can be viewed as a relax- 
ation of the concave extensible function property 
defined in Murota (2003, p. 93). Note also the similar- 
ity between the above definition and Definition 1. For 
a differentiable pseudoconcave function, the gradient 
at point y plays the role of a pseudogradient g(y) in 
condition (7). 

If the functions g { (y; n) in (3) are discrete pseudo- 
concave, then a feasibility cut at an integer point y 
that violates the /th constraint can be obtained in the 
form <f(y; nf(y — y) > e, where <f(y; n) is the pseudo- 
gradient of g ( (*; n) at y, and e > 0 is sufficiently small. 

Are the service-level functions we consider indeed 
discrete pseudoconcave? As an illustrative example, 



Figure 2 The Sample Average (Sample Size n = 500) of the Number 
of Calls Answered on Time in Period 2 as a Function ol the 
Staffing Levels in Periods 1 and 2 




Notes. The white dots on the bottom and the black dot on the top represent 
points where a pseudogradient does not exist. The thicker black lines show 
the boundaries of the satisfactory service levels for it equal to 70%. 80%, 
and 90%. 



consider a simple call center with two 30-minute peri- 
ods. The service times are exponential with a mean of 
15 minutes. The arrival process is a nonhomogeneous 
Poisson process with rate varying between 42 and 120 
calls per hour with an average load of 18 in period 1 
and 25 in period 2 (load is the arrival rate divided by 
the service rate). Figure 2 shows the number of calls 
answered on time (here, r = 90 seconds) in period 2 
as a function of the staffing levels in periods 1 and 2. 
(Note that this is equivalent to g with it = 0.) The 
number of servers ranges from 1 to 30 in period 1 
and from 1 to 40 in period 2. The function appears to 
follow a multidimensional extension of an S-shaped 
curve discussed in §1 (see also Ingolfsson et al. 2007, 
Atlason et al. 2004). 

To verify that the function is discrete pseudocon- 
cave, one can solve for each point, the linear (feasibil- 
ity) program 

min 0 

s.t. <j T (y - y) > 1 Vy e {y: gfy; n) > &(y; ")}, 
q>0. (8) 

Without the nonnegativity constraint, Problem (8) has 
a solution q if and only if a pseudogradient exists, 
and then q is such a pseudogradient. We added this 
constraint because the service-level function can rea- 
sonably be assumed to be nondecreasing in each com- 
ponent of y. The number of constraints in Problem (8) 
can be very large and it cannot be constructed without 
simulating the service-level function for all practical 
values of y. In practice, however, one can solve the 
LP using only the points visited by the algorithm to 
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check whether they are consistent with the assump- 
tion of pseudoconcavity. 

The results of Problem (8) for the function in Fig- 
ure 2 indicate that the function is very close to being 
discrete pseudoconcave. There are only a few points 
in the entire domain where a pseud ogradient does 
not exist. These points are depicted in Figure 2 by the 
white dots on the bottom and a single black dot (at 
(j/i/ 1/2) = 37 )) on t°P of the graph. Note that these 
points are in relatively flat areas of the graph. In the 
next section, we describe a simple way to advance the 
algorithm at such points, which we successfully used 
in the implementation of our cutting-plane method. 

Moreover, in a flat area, for a finite sample size, 
each additional call answered on time can have a 
significant effect on the order of the values of the 
service-level function. Therefore, it seems reasonable 
to assume that this situation would be alleviated for 
larger sample sizes, and in particular, to assume that 
the expected service-level function is discrete pseu- 
doconcave. Thus, to prove the convergence results 
in §4.3, we assume that the sample averages of the 
service-level functions are discrete pseudoconcave. 

4.2. A Simulation-Based Cutting-Plane Method 
for the Call Center Staffing Problem with 
Pseudoconcave Service-Level Functions 

In this subsection, we describe our modification of the 
ACCPM for Problem (3). At the beginning of a typi- 
cal iteration, we have available a feasible solution y* 
of (3), and an upper bound 2 = /(y*) on the optimal 
value of the problem. The point y* is the best feasible 
solution found by the algorithm so far. We also have 
available a polyhedron P = {y € W: y > 0, Dy > d) 
that is known to contain the feasible region. 

Suppose that y* , 2, and P are as above. If y* is not an 
optimal solution, then, because / (y) takes on integer 
values, the localization set 

{yeP: /(y) < 2- l,y integer} 

is nonempty and contains all feasible solutions with 
objective values better than 2. The localization set 
is empty precisely if y* is an optimal solution. This 
observation provides grounds for the termination cri- 
teria we specify in our algorithm. 

Computing the next iterate. First, we find the analytic 
center of a polyhedral relaxation of the localization 
set. In particular, we solve the following optimization 
problem: 

v 

min -«>log(2 — a — c T x)-Y^°Zyi 

-Elog(D,,y-^) 

s.t. Ax>y, 

-r>0, (9) 



where the constant a e (0, 1) ensures that only points 
with an objective value better than 2 are considered. 
Due to Assumption 1, the effective feasible region 
of (9) is bounded; hence the problem has an optimal 
solution as long as it has a feasible point in the effec- 
tive domain of the objective. If (9) has no solution, 
the algorithm terminates with y* as an optimal solu- 
tion; otherwise, let (y ac , r* 0 ) be a solution of (9). Here, 
w > 0 is the weight constant; we discuss later how 
this constant is determined in the algorithm. 

Note that the analytic center found in the previous 
step is almost certainly not integer, and rounding y ac 
to the nearest integer can result in a point outside of 
the localization set. To capitalize on the centrality of 
the analytic center in the localization set, we instead 
find the closest integer point in the effective feasible 
region of (9), i.e., solve 

min ||y-y"|| 
s.t. Ax>y 
c T x < 2 - 1 
Dy>d 

x, y > 0 and integer. (10) 

If (10) is infeasible, the algorithm terminates with y* 
as an optimal solution; otherwise, let (y,x) be the 
solution of (10) and choose y as the next iterate. Here, 
lly _ y ac ll is a measure of the distance between y 
and y ac . If we choose the Lj-norm as the measure, i.e., 
lly - y* c ll = ELi ly, - y? c |, then (10) is a linear integer 
program. We discuss the computational requirements 
of solving this problem at each iteration when we talk 
about the overall computational effort in relation to 
the computational experiments. 

Estimating the service levels. Next, we compute 
&(y; n ) for all i via simulation. 

Adding an optimality cut. If #,(y; n) > 0 for all i, then 
y satisfies the service-level requirements. Because 
c T x < 2 — 1, y is contained in the localization set, i.e., 
it is the best staffing level so far. Note that c T x is not 
necessarily the cost associated with staffing level y 
because c T x is not being minimized in (10). To com- 
pute the cost associated with y, we instead solve (1) 
to get /(y) and update 2 :=/(y). 

Adding a feasibility cut. If g,(y; n) < 0 for some i, 
then we add a feasibility cut for each i such that 
£, (y; h) < 0. In particular, we estimate a pseudogra- 
dient, <f (y; n), of #, (*; n) at y (we will discuss tech- 
niques for estimating the pseudogradients in §5.2). If 
<T(y; n ) # 0> we a dd a feasibility cut of the form 

?(9'>") T y><r&n) T 9+* (ID 

for some smaU constant e > 0 (we discuss the role of e 
in the discussion of the convergence of the method in 
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Figure 3 Illustration o1 the Feasible Region and an Iterate in the 
SACCPM 




§4.3). If if (y; n) = 0, the feasibility cut takes the form 
of a lower bound on the number of servers (see the 
discussion in §5.2). We update D and d to reflect the 
cuts added. 

The above procedure is then repeated. An illustra- 
tion of the localization set and the feasible regions and 
solutions of Problems (9) and (10) in each iteration is 
shown in Figure 3. The picture shows points in the 
space of the y variables in the case p = 2. The shaded 
area represents the localization set {y 6 P: /(y) < z — 1} 
(ignoring integer constraints). The squares represent 
the points that are feasible for the sample average 
approximation problem (3). The thick solid line seg- 
ments result from an optimality cut /(y) < z - 1 and 
the dotted lines represent the feasibility constraints 
Dy>d. Point y 3C is the solution of the analytic center 
problem, and y* is the closest integer in the localiza- 
tion set. The line just to the right of y k represents a 
feasibility cut that would be added in this iteration. 

We summarize the simulation-based analytic center 
cutting-plane method (SACCPM) for the call center 
staffing problem in Figure 4. (To shorten the presen- 
tation, the description of the algorithm in Figure 4 is 
only specified for the case when the constraint func- 
tions !,(•; n) are in fact discrete pseud oconcave.) 

The weight parameter w on the optimality cut can 
be increased to "push" the weighted analytic center 
away from the optimality cuts. There are no theoret- 
ical results on how to choose the weights, but some 
computational experiments have been done to test 
different values of the weights, and in fact, weight 
parameters on the feasibility cuts (Goffin et al. 1992, 
du Merle et al. 1998). The choice of the weights is 
problem and data dependent (see §5.2 for the partic- 
ular choice used in our implementation). 

In practice, it is useful to maintain a lower bound 
on the optimal value of Problem (3) throughout the 
algorithm, in addition to the upper bound z. In par- 
ticular, the algorithm can be terminated as soon as 



Figure 4 The Simulation-Based Analytic Center Cutting-Plane Method 
(SACCPM) 

Initialization Start with a solution /' feasible to Problem (3). 
Lot z° := / (if), and y := y*. Lot P° = (y > 0 : D°y > rf°), 
where D° and if are empty. Choose an «■ > 0, a € (0, 1) and 
«>°>0. LetA :=0. 

Stq> 1. Solve Problem (9) with zv = xv K , z = z k , and P = P\ 
If (9) does not have a (feasible) solution, then terminate 
with y* as the optimal solution and z k as the optimal value; 
otherwise let y*° be the solution of (9). 

Shy Z Solve Problem (10) with z = z l and P = P*. If (10) is 
infeasible, then terminate with y' as the optimal solution 
and z k as the optimal value; otherwise let y k be the 
optimal solution of (10) found. 

Step 3a. If £(y l ; h) > 0 Vi = 1 let z 141 := f(tf), xf := y*, 

D* +, :=D*,andi l+1 :=d l . 

Step 3b. If #,(y* ; n) < 0 for some i = 1 , . . . , p, then add the 
constraint if (if; n) T y > q'(y* ; n) T \f + e for each i such that 
£(/; n) <0. Update D k and d k to include the added 
inequalities, and let P K+l = {y > 0: f> +l y > d M J represent 
the new constraint set. Let z l+1 := z k . 

Step 4. Compute w k+l and let k :=* + 1. Go to Step 1. 

the gap between the upper and lower bounds is suf- 
ficiently small, indicating that the current "incum- 
bent" y* is a sufficiently good solution of the problem. 
A lower bound can be found as the optimal objective 
value of 

min /(y) 
s.t. Dy>d, 

y > 0 and integer, (12) 

or, for a weaker but easier to compute bound, of the 
linear programming relaxation of (12). 

4.3. Convergence of the SACCPM 

In this section, we give conditions under which the 
SACCPM terminates with y* as an optimal solution 
of (3). First, we argue that the algorithm does indeed 
terminate and then we show that y* is an optimal 
solution of (3) at termination. To prove the results, we 
make the following two assumptions. 

Assumption 2. The functions g, (y; n) are discrete 
pseudoconcave in y for all i = 1, . . . , p. 

Assumption 3. In the implementation of the 
SACCPM, <f (y; n) is a pseudogradient of £,(•; n) at y. 

We also define the sets 

T := {y > 0 and integer: /(y) < z 0 }, 
*:=rn{y:g,(y; n) > 0 V/ = 1, . . . , p), 
T := r\*, and 

%) := {/: £,(y;*)<o}. 

In words, Y is the set of points that are potentially 
visited by the algorithm, and it contains the set of 
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optimal solutions. The set M' is the set of points in f 
that are feasible for the sample average approxima- 
tion problem (3). The set T is the set of points in T 
that are not feasible for the sample average approxi- 
mation problem (3). The set /(y) is the set of periods 
in which the sample average of the service-level func- 
tion is not acceptable given the staffing levels y. The 
following lemma says that all solutions in ^ satisfy 
potential feasibility cuts (11) for some appropriately 
chosen e. 

Lemma 1. Let ^(y; n) be a pseudogradiait of £(•; n) 
at y and suppose that Assumptions 1 and 2 hold. Thai, 
there exists an i > 0 such that <f (y; «) T (y - y) > £ 
Vy€*,y€T, /€/(y). 

Proof. Let y e T be fixed. Suppose that <f(y; n) T • 
(y - y) < 0 for some y € * and i € /(y). Then, 
|,Cv; w) < gi(9'f ") < °, where the first inequality fol- 
lows by Assumption 2 and the second inequality fol- 
lows because i e J(y). This is a contradiction because 
y € V, and therefore rp(y; n) T (y - y) > 0 Vy e ^ 

Let e(y) = min ie/(P) min y€ ^ <f (y; «) r (y-y). The set ^ 
is finite by Assumption 1; therefore, the inner mini- 
mum is attained for some y eV. Because /(y) is also 
finite, the outer minimum is also attained for some 
i € J(y). Therefore, €(y) > 0. 

Finally, let £ = min^ €T e(y). Then, £ > 0 because T 
is finite. □ 

Theorem 1. Suppose that (3) has an optimal solution 
and that Assumptions 1, 2, and 3 hold. Furthermore, let 
0 < e < i, where i is as in Lemma 1. 77?ew, f/ie SACCPM 
terminates in a finite number of iterations returning y\ 
which is an optimal solution of (3). 

Proof. The SACCPM only visits points that are fea- 
sible solutions of (10). The set of feasible solutions of 
(10) at every iteration is a subset of T and therefore is 
finite by Assumption 1. Suppose that the Jrth iterate 
y k is the same as a previous iterate yi for some ; < k. 
If y> is in then f(y k ) < z - 1 < f(yi) - 1, where the 
second inequality follows because z is the cost of the 
best feasible solution visited before iteration Jt. This is 
a contradiction, therefore y> is not in V. On the other 
hand, if y' is in T, then £ (y>; n) < 0 for some i e /(y>). 
Because y k is in F, <p(y>; n) T (y k - yi) > e > 0, which is 
also a contradiction and y k can therefore not be equal 
to yi for any ; < k. Therefore, the SACCPM does not 
visit any point in T more than once. Because T is a 
finite set, the algorithm is finite. 

By Lemma 1, the feasibility cuts never cut off any 
of the feasible solutions of (3). At termination, Prob- 
lem (9) and/or Problem (10) does not have a feasible 
solution, which means that all the feasible solutions 
for (3) have an objective value greater than 2-1. But 
y* is feasible for (3) and has an objective value of z 



and is, therefore, optimal, because? f(y) is integer by 
Assumption 1. □ 

The theorem says that there exists an £ such that the 
algorithm terminates with an optimal solution of (3) 
if 0 < e < £. In practice, the value of £ is unknown, but 
in general, a ''small" value for e should be chosen as 
in §5.4. 

5. Numerical Results 

In this section, we give an implementation of the 
SACCPM for a call center with time-varying arrival 
rates. In §5.1, we describe the example that we use 
for the numerical experiments. In §5.2, we discuss 
how to compute, or estimate, the pseudogradients, 
and we also describe how the algorithm was imple- 
mented. We compare the results with staffing levels 
obtained by analytical queueing methods (see §1 for 
references). Analytical queueing methods based on 
the Erlang C formula (14) are widely used to deter- 
mine staffing levels in call centers (Cleveland and 
Mayben 1997). In Green et al. (2001, 2003), several 
heuristics for improving the performance of the basic 
Erlang C model are evaluated and we give a summary 
of these heuristics in §5.3. We believe that the meth- 
ods in Green et al. (2001, 2003) are among the best 
analytical methods for determining required staffing 
levels to answer a minimum fraction of calls before 
a threshold time limit. Therefore, we use these meth- 
ods as benchmarks for our method. The results of our 
experiments are in §5.4 and we comment on the com- 
putational requirements of the SACCPM in §5.5. 

5.1. Example 

Our test model is similar to the models used in Green 
et al. (2001, 2003), which are call centers that can 
be modeled as M(t)/M/s(t) queueing systems. In 
the Green et al. (2001) paper, a call center operating 
24 hours a day and seven days a week is studied, 
while the subject of the Green et al. (2003) paper is 
call centers with limited hours of operation. We con- 
sider a call center with limited hours of operation that 
is open from 6 a.m. to 12 a.m. 

The call center has the following additional char- 
acteristics. The planning horizon consists of a sin- 
gle day's operation. The 18-hour planning horizon is 
broken into 72 time periods, each of length 15 min- 
utes. In each period, 80% of calls should be answered 
immediately. This is equivalent to setting t = 0 and 
7r = 0.8. The service times are independent exponen- 
tial random variables with rate /x; see Table 1. The 
size of the system is measured by average load R, 
which is the average arrival rate divided by the ser- 
vice rate. The arrival process on any given day is 
a nonhomogeneous Poisson process with a continu- 
ous, piecewise-linear arrival rate defined at the end of 
period i by A, = A(l + flsin^Trfo - 6)/18)), where f f is 
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Table 1 The Parameter Settings in the Experiments 



Parameter 


Low 


High 




4 calls/hour 


16 calls/hour 


R 


8 


32 


e 


0.25 


0.75 


Shifts 


Yes 


No 



the end time of period i and is measured in the hour 
of the day (e.g., f 0 = 6 and f 72 = 24). The average daily 
arrival rate is A = R/i and 0 is the relative amplitude. 
Calls are answered in the order they are received. 
When there is a reduction in the number of servers at 
the end of a period, outgoing servers complete calls 
that they are already servicing before ending their 
shifts. A new call cannot enter service until there are 
fewer calls in service than there are servers for the 
new period. 

We study the performance both in the presence of 
shift constraints and when there are no shift con- 
straints. When there are no shift constraints, the tour 
matrix A is the identity matrix in W, and the cost for 
each tour is equal to one agent-period, i.e., the total 
staffing cost equals the total number of servers over 
all 72 periods. We assume that the shift constraints 
are such that each tour covers six hours, or 24 peri- 
ods. The tours can only start on the hour and no later 
than six hours before the end of the day. This results 
in 13 tours: 6 A.M.-12 p.m., 7 a.m.-I p.m.,..., 5 p.m.- 
11 p.m., and 6 p.m-12 a.m. There are no meal-type 
breaks incorporated in the shifts, but they can easily 
be included by changing the appropriate values in the 
tour matrix A The cost for each tour is equal to 24 
agent-periods per tour. 

Note that we are yet to specify a value for ft, R, and 
0. In the experiments, we study how the SACCPM 
performs under two different settings (high and low) 
of each of these parameters. Along with the two set- 
tings for the shift constraints, this results in a total 
of 2 4 = 16 experiments. The high and low values for 
each parameter are given in Table 1. Green et al. 
(2001, 2003) additionally studied the effect of the tar- 
get probability, 7r. The target level did not appear 
to have as significant an effect on the reliability of 
each method as the other parameters did, therefore 
we do not include different settings of it in our study. 
Instead of shifts, they included a factor that they call 
the "planning period" which is similar to the shift fac- 
tor in that the number of servers is constant in each 
planning "period" (which can be longer than what 
we, and they, call a period and measure the perfor- 
mance in). 

5.2. Estimating the Pseudogradients and 
Implementing the SACCPM 

Before we describe the implementation of the con- 
tinuous and discrete optimization problems solved in 



each iteration of the SACCPM, we discuss what is per- 
haps the most challenging part of the implementation 
of the algorithm: estimating the pseudogradients. Up 
until now, we have assumed that in the implemen- 
tation of the SACCPM, <f (y; n) is a pseud ogradient 
of gii'in) at y. What makes computing (or estimat- 
ing, because it may prove difficult to ensure that the 
resulting vector satisfies condition (7) exactly) such 
a pseudogradient particularly challenging is that the 
service-level function is a discrete function of the 
number of agents. In addition, we do not have a 
closed-form expression of the function. 

The simplest and perhaps the most intuitive 
method for estimating a gradient (or pseudogradient) 
when an expression for the function is unknown is 
the method of finite differences (see, e.g., Andradottir 
1998). The method of finite differences can easily be 
extended to discrete functions, i.e., we let 

^(y;»)=ft(y+^;n)-ft(^;fi) vj = i,..., P , (13) 

where e } is the ;th unit vector in W, be the esti- 
mate of the pseudogradient of £(y; n) at the staffing 
level y. It is clear that p + 1 function evaluations 
of g i are required to compute q*. We note, however, 
that estimates of the pseudogradients of the service- 
level functions g t in all p periods, i — 1, . . . , p, can be 
obtained from those same p + 1 simulations. 

The number of periods p can be quite large in 
staffing applications. Therefore, if we could estimate a 
pseudogradient from a single simulation, the compu- 
tation time of the SACCPM would be greatly reduced. 
Atlason (2004) includes, however, an implementa- 
tion of the SACCPM using infinitesimal perturbation 
analysis to estimate pseudogradients, and the solu- 
tions from that algorithm are inferior to solutions 
that the algorithm gives when finite differences are 
used. Pseudogradient estimates based on the likeli- 
hood ratio gradient estimation method have also not 
been reliable (Atlason 2004, Atlason et al. 2003). 

At low staffing levels, there are so few servers, and 
calls are so backed up, that no calls are answered 
on time, and adding a server does little to allevi- 
ate the situation; cf. low values of y x and y 2 in Fig- 
ure 2 It is certainly possible that we would encounter 
such a staffing level in the cutting-plane algorithm for 
the sample average approximation of the service-level 
function; an attempt to compute or estimate a pseu- 
dogradient at this point would produce a zero vector. 
Therefore, as a feasibility cut at such a point y, we 
impose a lower bound on the number of servers in the 
period i, ie., add the constraint y, > y, -f 1. This is not 
necessarily a valid feasibility cut because the reason 
for calls backing up might be understaffing in previ- 
ous periods. In our implementation, if such a cut is 
added during the algorithm, we verify at termination 
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that the corresponding constraint is not tight at the 
optimal solution found. If it is tight, then we lower 
the bound and do more iterations of the cutting-plane 
method. 

Other parts of the SACCPN4 were implemented as 
follows. We used a sample size of n = 100. We built a 
simulation model using the ProModel simulation soft- 
ware to compute the sample average of the service- 
level function. We used Visual Basic for Applications 
and Microsoft Excel to store the data and to com- 
pute the cuts. We used the AMPL modeling language 
to model, and call a solver for, the analytic center 
problem (9) and the IP (10) of finding an integer 
point close to the analytic center. We used the MINOS 
solver to solve the analytic center problem (9). We 
used the CPLEX solver to solve the IP (10). We used 
the Excel solver to compute the cost /(y) of a partic- 
ular staffing level y. 

Finally, we describe the settings of the parameters 
that are specific to the SACCPM. We let w k = r, where 
r is the number of feasibility cuts that have been 
added in iterations 1 through jt — 1 (we let w k = 1 
if no feasibility cuts have been added). We used e = 
10~ 5 and chose a = 1 — e. Instead of specifying a par- 
ticular feasible starting point y°, we started with an 
upper bound on the staffing levels in each period to 
ensure boundedness of the localization set. For the 
parameters chosen in the experiment, it is unlikely 
that more than 130 agents will be allocated in any 
period in an optimal solution. Hence, we added the 
term - £? =1 log(130 - y,) to the objective of the ana- 
lytic center problem (9) and the constraints y, < 130 
V/ = l,...,pto the IP (10). 

5.3. Analytical Queueing Methods 

In this section, we describe the queueing methods in 
Green et ah (2001, 2003) that we use as benchmarks 
for our computational study. Traditional queueing 
methods for staffing call centers assume that the pro- 
cess is in steady state, i.e., the arrival rate is fixed and 
the call center has been open long enough, so that the 
initial state of the call center does not matter. Then, 
assuming that the call center can be modeled as an 
M/M/s queueing system, the probability that a cus- 
tomer must wait more than t time units, as a function 
of the number of servers s, is given by (Cooper 1981, 
pp. 91, 97) 

P(s) = C(s,R)c-^-» r fors>K, (14) 

where R = X//jl is the offered load. If $ < R, then 
P(s) = 1. The term C(s, R) is the probability that an 
incoming call is delayed, also called the "Erlang-C 
probability of delay." It can be computed in a numer- 
ically stable way via the recursion (Cooper 1981, 
pp. 82, 92) 

B(0, R) = l, 



When the arrival rate changes between periods or 
within periods, Green et al. (2001) proposed adjust- 
ing the value of the arrival rate A used in (14). This is 
a straightforward method that can give good staffing 
levels, at least for a call center operation that is similar 
to the M(t)/M/s(t) model. They consider six differ- 
ent adjustments of the arrival rate. The six different 
schemes are given in Table 2. In Green et al. (2003), 
only SIPPavg, LAGavg, and LAGmax are considered. 
When we computed the arrival rate to use for the 
LAG methods in the first period, we assumed that the 
arrival rate prior to time zero was equal to the arrival 
rate at the beginning of the first period. 

The required staffing, y f , in period i, is computed 
by letting A = A f in (14), and letting 

y, = min{s > 0 and integer: P(s) < 1 - n}. 

The cost of the resulting staffing level is f (y). 

5.4. Results 

The parameter settings and the cost of the staffing lev- 
els obtained by each method in each of the 16 exper- 
iments are shown in Table 3. The table also shows 
the cost savings of the SACCPM versus the LAGavg 
method. We chose the LAGavg method for compar- 
ison because it performs best of the six analytical 
methods. The comparison only includes the feasible 
solution determined by the criteria described below. 

Determining feasibility. The solutions obtained by 
the six queueing methods and the SACCPM are not 
guaranteed to be feasible for the call center staffing 

Table 2 Different Methods for Adjusting the Arrival Rate to Use 
in Equation (14) 

Method a, 

SIPPavg — !— x(t)dt 

SIPPmax max A(/) 

SlPPmix If A(f) is nondecreasing in period /'. use SIPPavg rate, 
otherwise use SIPPmax rate. 

LAGavg i—r-f Affltff 

LAGmax max A(f) 

LAGmix If k{t) is nondecreasing in [t,_ y - 1 t, - 1 /pj, use 
LAGavg rate, otherwise use LAGmax rate. 

Note. Here, f, is the time when period /' ends {t 0 = 0). 1/p is the mean 
service time, and a, is the rate to be used to determine the staffing 
in period /. 
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Table 3 Cost of the Solutions in Agent-Periods 



Experiment n R 0 Shifts 



oAOUrM 


SIPPavg 


SIPPmax 


SIPPmix 


LAGavg 


LAGmax 


LAGmix 


Savings (%) 


1.008 


1.056 


1 fKR 


l.UDD 


1 nee 
1.05b 


1.056 


1.056 


4.6 


1.032 


1.056 


1 n^R 


i.UOO 


lQoZ 


1.056 


1.032 


0.0 


3,456 


3552 


O.U£*t 


0,3/0 


O ACC 

0,456 


3,552 


3.552 


0.0 


3,504 


3,552 


0,G£*4 


O.O/O 


0,0/0 


3,576 


3.528 


2.0 
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1 4 8 0.75 Y 32 

2 16 8 0.75 Y 126 

3 4 32 0.75 Y 128 

4 16 32 0.75 Y 512 

5 4 8 0.25 Y 32 

6 16 8 0.25 Y 128 

7 4 32 0.25 Y 128 

8 16 32 0.25 Y 512 

9 4 8 0.75 N 32 

10 16 8 0.75 N 128 

11 4 32 0.75 N 128 

12 16 32 0.75 N 512 

13 4 8 0.25 N 32 

14 16 8 0.25 N 128 

15 4 32 0.25 N 128 

16 16 32 0.25 N 512 
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the cost savings of using the SACCPM versus the LAGavg method. 



problem with expected service-level constraints (2) 
(although it is true that a solution obtained by the 
SACCPM is feasible for the respective sample aver- 
age approximation problem). To determine feasibil- 
ity, we increased the sample size to n = 999 (the 
maximum in ProModel) and simulated the call cen- 
ter using each of the solutions found, at the staffing 
level vector Ax. We simulated at Ax rather than y 
because the ith component of the vector Ax is the 
actual number of agents available in period /" and can 
be greater than y t because of slack in the shift con- 
straint Ax>y in (1). Because the simulation output 
of the 999 experiments is still random, we decided 
to declare the service-level requirements not met if 
the fraction of calls answered immediately is less than 
75% in any period instead of the n = 80% from the 
example and was used in the implementation of the 
SACCPM algorithm. This is similar to what Green 
et al. (2001) used to determine feasibility. They com- 
puted the service level using numerical integration of 
transient queueing formulas and said that the service 
level is infeasible in a period if the target probabil- 
ity is exceeded by at least 10%. We chose 75% as our 
threshold because the largest 95% confidence interval 
half-width in our simulations using n = 999 was 4.4% 
and 80% - 4.4% % 75%. 

The feasibility of the solutions is reported in 
Table 4. We first note that all the methods do well in 
most cases. The SIPPavg method struggles when there 
are no shifts present and the service rates are low. 
In this case, there is significant dependence between 
time periods, which none of the SIPP methods take 
into account. We also note that the solutions from 
the SACCPM do not always meet the service-level 
requirements by the 75% criterion, and in many peri- 
ods fail to meet the 80% requirements. This is because 



the sample size was only 100 when the solution was 
computed, therefore there Is a significant error in the 
estimates of the service-level functions. 

Ranking the methods. In Table 3, we put in boldface 
text the cost of the method that we declared a "win- 
ner." We selected a winner based on two criteria. The 
first criterion is that the service level must be greater 
than 75% in all periods based on the simulation using 
sample size 999. The second criterion is cost. We see 
that the SACCPM is a winner in all but three experi- 
ments. In two of these, the solutions fail to meet the 
feasibility criterion. In the third case, the cost of the 
best solution was only 0.3% lower than that obtained 
by SACCPM. In the case of ties, it would make more 
sense to use one of the analytical queueing heuristics 
because they are much easier to implement. However, 
one would not know beforehand whether a tie would 
occur, and which heuristic to use. 

The analytical queueing heuristics have been 
shown to do well on this particular type of queue- 
ing model. In the SACCPM, however, no assumptions 
are made on the arrival process or the service times, 
therefore it may apply in even more general settings. 

In §5.2, we noted that the finite difference estima- 
tor of a pseudogradient for low staffing levels can 
be zero. In one or more iterations in some of the 
experiments, the finite difference method produced a 
zero vector, which could not be used to generate fea- 
sibility cuts. Instead, we imposed the lower bounds 
described in §5.2 and then checked upon termination 
of the algorithm whether these lower bounds were 
tight. The bounds were tight in experiment 16, there- 
fore we relaxed the bounds and ran the algorithm 
until it terminated again, this time with a solution 
where these kinds of bounds were not tight. 
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Table 4 Feasibility ot the Solutions and Number of Iterations ot the SACCPM 
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SACCPM 


Olr rdvU 
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C 1 D Dmiu 

OlrrmiX 




LAbmax 


LAG mix 


Iterations 


Feas. cuts 


Opt. cuts 


1 
2 


2 
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1 

79.0% 


1 

79.0% 


2 

79.0% 


81.3% 


84.4% 


85.9% 


I / 


16 


0 


82.1% 


83.1% 


83.1% 


83.1% 


82.1% 


83.1% 


82.1% 


Of) 


or 
6o 


n 
9 


3 


76.7% 


74.4% 


81.6% 


81.5% 


82.1% 


85.9% 


82.5% 


10 


77 


3 


4 


82.1% 


82.0% 


84.3% 


83.1% 


82.0% 


86.2% 


82.1% 


1 D 


9/ 


4 


5 
6 


82.4% 


83.1% 


81.3% 


82.5% 


81.8% 


83.3% 


83.1% 


39 


34 


16 


81.8% 


81.8% 


81.8% 


81.8% 


81.8% 


81.8% 


81.8% 


39 


115 


14 


7 


80.1% 


80.4% 


82.5% 


80.8% 


B3.5% 


83.8% 


83.8% 


12 


123 


3 


8 


3 

76.6% 


82.0% 


82.5% 


83.0% 


81.6% 


82.3% 


81.6% 


11 


119 


3 


9 


15(3) 

74.2% 


14(3) 

73.2% 


3 

76.2% 


3 

76.2% 


2 

76.5% 


81.2% 


1 

79.8% 


47 


210 


6 


10 


12 

77.8% 


3 

78.4% 


80.8% 


80.8% 


2 

79.9% 


81.1% 


2 

79.9% 


24 


173 


7 


11 


21 (3) 

74.7% 


33 (23) 

61.2% 


29(4) 

71.5% 


29(4) 

71.5% 


4 

78.7% 


80.8% 


80.7% 


38 


374 


4 


12 


5 

77.9% 


11 

76.0% 


80.8% 


80.8% 


1 

79.7% 


81.9% 


80.2% 


26 


271 


5 


13 


7 

76.1% 


1 

79.9% 


80.5% 


80.5% 


1 

80.0% 


1 

80.0% 


1 

80.0% 


51 


273 


8 


14 


6 

75.9% 


3 

79.0% 


1 

79.8% 


2 

79.6% 


2 

79.6% 


1 

79.8% 


2 

79.6% 


27 


290 


7 


15 


17 

75.1% 


19 

76.7% 


8 

78.2% 


8 

78.2% 


3 

79.0% 


80.3% 


2 

79.0% 


32 


294 


4 


16 


10 

76.6% 


2 

79.1% 


81.1% 


81.1% 


1 

79.7% 


81.1% 


81.1% 


53 


376 


7 



Notes. Each cell has up to three values. The first value is the number of periods in which the fraction of calls answered immediately is less than 80%. The 
boldface values in parentheses are the number of periods in which the fraction of calls answered immediately is less than 75%. The percentages show the 
lowest fraction of calls answered immediately in any period. We do not display the first two values when they are equal to zero. The last three columns show 
the number of iterations of the SACCPM and the number of feasibility and optimality cuts added. 



5.5. Computational Requirements 

We can divide each iteration of the algorithm into 
three main parts in terms of computations: 

1. Solve the analytic center problem (9): This usually 
took less than one second using the MINOS solver 
and never took more than three seconds. 

2. Solve the IP (10) to get an integer solution close to 
the analytic center: In the initial stages of the algorithm, 
this takes on the order of seconds to solve. However, 
when there were no shifts, meaning that the solution 
space for y is larger, it could take millions of branch- 
and-bound nodes to find an optimal solution after a 
number of cuts were added. Because it is not nec- 
essary to find an optimal solution of the IP (10) to 
advance the algorithm, we often terminated the IP 
with a suboptimal, but feasible, solution to determine 
the next iterate. If the IP seemed infeasible, but the 
infeasibility was difficult to prove and the optimality 



gap in the SACCPM was less than 1%, the SACCPM 
was terminated with a possibly suboptimal solution. 

Solving the IPs was much easier when the shift con- 
straints were included, which is the scenario one usu- 
ally faces in practice. In that case, an optimal solution 
of the IP was found, or the IP was deemed infeasible, 
in a matter of seconds. 

3. Simulate to estimate the expected service level and 
gradients: Simulating at each staffing level in Pro- 
Model took from six seconds to about one minute, 
depending on the number of arrivals to the system, 
on a Pentium 4 3 GHz computer. Up to 73 such sim- 
ulations are required per iteration, although we did 
not always compute all 72 differences to compute 
the FD pseudogradient. It appeared that dependence 
between time periods was not a factor over more than 
10 periods, so as a rule of thumb, we only computed 
the differences for the 10 periods preceding period i, 
and for period / if the service-level constraint was 
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not satisfied in period /. Staffing levels in subsequent 
periods do not have any effect on the service level in 
period / because the requirement is to answer 8U% of 
the calls immediately. 

Hence, estimating the pseud ogradients and solving 
the IP (10) requires the most computation; see §6 for 
ideas on how the computational requirements can be 
reduced. 

The number of iterations required is given in 
Table 4. In the initial stages, the SACCPM sometimes 
produced several optimality cuts before any feasibil- 
ity cuts were generated. Because the weight on the 
optimality cuts is only one in the beginning, the opti- 
mality cuts will be fairly close to each other in such 
a case, and progress will be slow. This happens when 
the starting point has much higher staffing levels than 
what is really needed. When this occurred, we added 
deeper optimality cuts until the first feasibility cuts 
were generated by the algorithm, i.e., we lowered z. 
One could start the algorithm close to the solutions of 
the analytical queueing heuristics, to hopefully speed 
up the convergence. We did not try to implement this 
approach. 

6. Conclusions and Future Research 

The SACCPM presented in this paper is a promis- 
ing method for computing optimal staffing levels in 
a call center when traditional methods fail. Although 
the computational requirements of our method are 
large, we were able to solve, or at least approximately 
solve, a number of moderately sized hypothetical call 
center staffing problems. The results of the SACCPM 
were encouraging and they show that the method can 
potentially be applied in real-world situations with 
good results. Furthermore, the algorithm can be auto- 
mated, so that no user intervention is required while 
the algorithm is running. 

The algorithm is robust in the sense that it works 
well on a range of different problems which gives 
it further credibility over the traditional methods. 
Compared to the traditional queueing methods, the 
SACCPM does best in the presence of shift con- 
straints. To see why, recall that the SACCPM solves 
both the problem of determining minimum staffing 
levels and the problem of computing the best assign- 
ments to cover these staffing levels simultaneously, 
while the queueing methods compute the required 
staffing levels first and the shift assignments after- 
wards, using the staffing levels as input: 

There are several interesting directions for related 
future research, some of which we are actively pur- 
suing. From the numerical experiments, we identified 
that both the simulations and solving the integer pro- 
grams can be computationally expensive. It would be 



beneficial to study more efficient methods for estimat- 
ing the pseudogradients that could mimic the perfor- 
mance of the finite difference method. In relation to 
the integer programs, one should investigate integer 
programming algorithms that can utilize the special 
structure of the relaxed problems solved in each iter- 
ation and consider allowing approximate solutions of 
the IPs, especially in early stages of the algorithms. 
Another technique often used in the continuous case 
to speed up the computation of the next iterate is 
to drop cuts that are redundant (see, e.g., Mitchell 
2003 for more on this approach), and it is quite pos- 
sible that the same technique could reduce the time 
required to solve the IPs in the later stages of the 
SACCPM. One could also generate some initial cuts 
by running simulations at the staffing levels sug- 
gested by heuristics. 

It is conceivable that other optimization methods 
could perform well in this setting. The extended 
cutting-plane method in Westerlund and Porn (2002) 
seems to fit the framework particularly well, although 
some details of the implementation are unclear. 

The problems solved in this paper were fairly sim- 
ple instances of a call center staffing problem, but 
because no assumptions are made on the arrival and 
service processes and simulation is used to evaluate 
performance, it seems that the method would also 
apply in more complicated settings. Call abandon- 
ments, skill-based routing, and prioritizing multiple 
customer classes are problems that call center man- 
agers commonly face, and it would be interesting to 
incorporate those in the algorithm. Moreover, it is 
likely that the SACCPM could, with appropriate mod- 
ifications, be applied to problems other than call cen- 
ter staffing. 
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tSZZ ^ I!?*" 1 " CUtti ° 8 PlaDe meth ° d fcr minimi2in 8 ^"8 costs in a service system 

subject to saustymg acceptable service level requirements over multiple time periods. We assume that the 
service level cannot be easily computed, and instead is evaluated using simulation. The simulation uses the 

'T 10 ? T*° m T^T S ° ** *** Sequence of random Phenomena is observed when 
evaluating different staffing plans. In other words, we solve a sample average approximation problem. We 
esubhsh convergence ot the cutting plane method on a given sample average approximation. We also es- 
tabhsh both convergence, and the rate of convergence, of the solutions to the sample average approximation 
o solutions ot the ongtnal prob.em as the sample size increases. The cutting plane method redes on the 
serv.ce level tunctions be.ng concave in the number of servers. We show how to verify this requirement as 
our algorithm proceeds. A numerical example showcases the properties of our method, and sheds light on 
when the concavity requirement can be expected to hold. 

Keywords: simulation optimization, call centers, cutting planes, sample average approximation 
1. Introduction 

In this paper we present the theoretical properties of a cutting plane method for mini- 
mizing staffing costs in a service system subject to satisfying acceptable service level 
requirements over multiple time periods. This method was proposed by Hendeison and 
Mason (1998) and combines simulation and integer programming in an iterative cutting 
plane algorithm. Simulation is a powerful method for analyzing complex systems but 
optimization with simulation can be difficult. Linear integer programming problems 
atong with many other mathematical programming models, are well studied and many 
algorithms have been developed for solving problems in this form, but a simplification 
ot the system is often required for modelling. Our iterative cutting-plane algorithm com- 
bines simulation and linear (integer) programming to solve resource allocation problems 
where the objective function, or constraints, or both, are evaluated via simulation The 
algorithm relies on the concavity of the problem constraints, but in our algorithm we 
have a built-in robustness, so that nonconcavity can be detected. 

* Corresponding author. 
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The method of combining simulation and optimization in this way has potential 
applications in various service systems, such as call center staffing (which will be the 
focus of this paper) and emergency vehicle dispatching (which we are currently investi- 
gating). In fact, the method could potentially, with appropriate modifications, be utilized 
in many other areas where simulation is an appropriate modelling tool. 

The problem of determining optimal staffing levels in a call center (see, e.g., 
Thompson (1997)) is a motivating example for our work. The decision maker' faces 
the task of creating a collection of tours (work schedules) for the call center of low cost 
that together ensure a satisfactory service level. A tour is comprised of several shifts 
and has to observe several restrictions related to labor contracts, management policies, 
etc. We divide the planning horizon (typically a day or a week) into small periods 
(15-60 minutes) and focus on the service level in each period. We define the service 
level in a given period as the fraction of calls received in that period answered within 
a specified time limit. In this paper we focus on the problem of minimizing cost while 
satisfying the service level and scheduling constraints. 

The traditional method for solving this problem involves two steps. First, the re- 
quired staffing level in each period is estimated, independently period by period, often 
using i.e. queueing theory. Second, an integer program is solved to determine how many 
workers should be assigned to each of the tours in order to "cover" the previously as- 
signed staffing levels. Our method combines these two steps and allows for dependence 
between periods. There are examples in the literature (Green, Kolesar, and Soares, 2001; 
Ingolfsson, Haque, and Urnnikov, 2002; Jennings et al., 1996) that show that significant 
cost savings can be obtained by doing so, or that a staffing level obtained by assuming 
independence does not meet performance criteria when there is, in fact, dependence be- 
tween periods. Indeed, we present an example at the end of the paper showing that the 
staffing level in one period can have a considerable effect on the service level in another 
period. Green, Kolesar, and Soares (2001) and Jennings et al. (1996) suggest a relatively 
simple method for determining the required staffing levels that accounts for such depen- 
dence. Their method is based on infinite server queuing models, but requires that the 
call center can be accurately modelled as a G(t)/G(t)/s(t) queuing system. 

The cost function is usually relatively straightforward to calculate. We can calcu- 
late the cost of each tour (salary costs, appeal to employees, etc.), and multiply by the 
number of employees working each tour to get the overall cost. The service level, on the 
other hand, can be very difficult to obtain. Queuing models can be used for simple prob- 
lems, but simulation must be used to accurately model complex systems. The difficulty 
with using simulation is the large number of possible solutions since it is impractical 
to evaluate all of them. By using integer programming, we hope that we only need to 
simulate a small portion of the solution set. 

Simulation has been widely used to analyze the impact of different staffing lev- 
els on service levels and commercial simulation packages, specially designed for call 
centers, are available. Integer programming has also been used, in which case the staff 
requirements in each period are usually needed as an input in the model (see Mehrotra, 
Murphy, and Trick (2000)). 
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We present a cutting plane method based on the one developed by Kelley, Jr. 
(1960). The method solves a linear (integer) program to obtain the staffing levels, and 
the solution is used as an input for a simulation to calculate the service level. If the 
service level is unsatisfactory, we add a constraint to the linear program and go to the 
next iteration. 

Kelley's cutting plane method applies to minimization problems where both the 
objective function and feasible region (of the continuous relaxation of the integer prob- 
lem) need to be convex. The costs in the call center problem are linear and we will 
assume that the service level function is concave, so that (see equation (1», the feasi- 
ble region, relaxing the integer restriction, is convex. Since the service level function 
is unknown beforehand, we need to incorporate a mechanism into the method to verify 
that the concavity assumption holds. In section 5 we present a numerical method for 
checking concavity of a function, when the function values and possibly gradients are 
only known at a finite number of points. 

Morito et al. (1999) use simulation in a cutting-plane algorithm to solve a logis- 
tic system design problem at the Japanese Postal Service. Their problem is to decide 
where to sort mail provided that some post offices have automatic sorting machines but 
an increase in transportation cost and handling is expected when the sorting is more cen- 
tralized. The algorithm proved to be effective for this particular problem and found an 
optimal solution in only 3 iterations where the number of possible patterns (where to 
sort mail for each office) was 2 30 . Their discussion of the algorithm is ad hoc, and they 
do not discuss its convergence properties. 

Ingolfsson, Haque, and Umnikov (2002) present an algorithm for solving a call 
center staffing problem that uses a genetic algorithm for the optimization component 
and numerical solution of differential equations for evaluating the service level. Ingolf- 
sson and Cabral (2002) have developed a cutting plane algorithm for this problem using 
queuing models instead of simulation to calculate the service levels. The cuts are gen- 
erated using a heuristic, based on approximating the service level in each period as a 
function of the staffing level in that period, and may not be valid, although examples 
suggest good performance. 

Cutting plane methods have been successfully used to solve two stage stochastic 
linear programs. In many applications the sample space becomes so large that one must 
revert to sampling to get a solution (Birge and Louveaux, 1997: Infanger, 1994). The 
general cutting plane algorithm for two stage stochastic programming is known as the 
L-shaped method (van Slyke and Wets, 1969) and is based on Benders decomposition 
(Benders, 1962). Stochastic decomposition (Higle and Sen, 1991) for solving the two 
stage stochastic linear program starts with a small sample size, which is increased as the 
algorithm progresses and gets closer to a good solution. Stochastic decomposition could 
also be applied in our setting, but that is not within the scope of this paper. 

The random nature of our problem and the absence of an algebraic form for the 
service level function makes the optimization challenging. We use sampling to get an 
estimate of the service level function, and optimize the sample average approximation. 
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An important question is whether the solution to the sample average approximation con- 
verges to a solution to the original problem, and if so, how fast. 

We apply the strong law of laige numbers to prove conditions for almost sure con- 
vergence and apply a result due to Dai, Chen, and Birge (2000) to prove an exponential 
rate of convergence of the optimal solutions as the sample size increases. Vogel (1994) 
proved almost sure convergence in a similar setting, but we include proofs for reasons 
listed in section 4. 1. Shapiro and Homem-de-Mello (2000) established conditions for an 
exponential rate of convergence of the probability that the solution to the sample aver- 
age approximation is exactly the solution to the original problem in the case of a discrete 
distribution and Vogel (1988) proved a polynomial rate of convergence in a similar set- 
ting, but under weaker conditions than we require. The optimization of sample average 
approximations has also been studied in the simulation context (Chen and Schmeiser, 
2001; Healy and Schruben, 1991; Robinson, 1996: Rubenstein and Shapiro, 1993). 

The main contribution of this paper is to demonstrate the potential of bringing 
simulation and traditional optimization methods together. We establish the properties 
of a new method for solving call center staffing problems. The method is carefully 
developed because we believe that the same idea can be applied to resource allocation 
problems other than staffing problems, as previously mentioned. In addition, we present 
a numerical method for checking the concavity of a function when the function value 
and possibly gradient is only known at a finite number of points. 

The computing requirements of the algorithm presented here, as applied to 
realistically-sized problems, are rather large. Indeed, it is often the case that the cov- 
ering integer programs alluded to earlier (with predetermined staffing levels in each pe- 
riod) are difficult to solve, so that iterating such a step with simulation appears to be a 
rather formidable computational task. We view this work as a first step in the process of 
integrating the steps of determining work requirements and covering the work require- 
ments with tours. Subsequent work will focus on exploring methods for making the 
approach computationally feasible. We have many ideas for how this could be achieved; 
see section 7 for more comments on this issue. 

The paper is organized as follows. We formulate the call center staffing problem 
in section 2. We present the cutting plane algorithm and its convergence properties in 
section 3. The convergence and the rate of convergence of the solutions of the sample 
average approximation to solutions of the original problem are proved in section 4. The 
numerical method for checking concavity is described in section 5 and an implementa- 
tion of the overall method is described in section 6. Conclusions and considerations for 
future research are given in section 7. 



2. Call center staffing 

In this section we formulate and discuss in more detail the call center staffing problem 
of minimizing cost subject to service level constraints. 
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2. L Formulation and notation 

The management of a call center needs some criteria to follow when they decide on a 
set of staffing levels. It is not unusual in practice to determine the staffing levels from a 
service level perspective. In an emergency call center, for example, it might be required 
that 90% of received calls should be answered within 10 seconds. 

We introduce terminology and notation before we formulate the problem. The 
set of permissible tours (predefined work schedules over the planning horizon) can be 
conveniently set up in a matrix (see Dantzig (1954)). More specifically we have 




1, if period i is included in tour y\ 
0, otherwise. 



From the above we see that a column in A represents a feasible tour and a row in A 
represents a specific period. We let p be the total number of periods and m be the number 
of feasible tours. If we let x e R m be a vector where the j th component represents the 
number of employees that work tour j 9 then Ax = y e R p is a vector where the /th 
component of y corresponds to the number of employees that are working in period i. 
We let c be the cost vector, where d is the cost per employee working tour j. 

Next we define the service level constraints. We let / e R p be the vector whose i th 
component is the minimum acceptable service level in period / , for example, 90%. Since, 
for example, the arrival and service times of customers are not known but are random, the 
service level in each period will be a random variable. Let f , a random vector, denote all 

the random quantities in the problem and let f " denote independent realizations 

off. Let N l (?) be the number of calls received in period i and let S'(y, f ) be the number 
of those calls answered within a pre-specified time limit, for example, 10 seconds, based 
on the staffing level y. The fraction of customers receiving adequate service in period i 
in the long run is then 

lim J 'fog'> = ^co^'ELi^Cy,^) 

£2=i Hm^con-^Li^^) ■ 

If E[N l (%)] < <x) then the strong law of large numbers can be applied separately to 
both the numerator and denominator of this expression, and then the desired long- 
run ratio is £[S'(y, £)]/£[#'(£)]. Thus, is a natural 
representation of the service level constraint (excluding the pathological case 
= 0) in period /. If we define G l (y,£) := S'(y,£) - / , > , (?) then 
we can conveniendy write the service level constraint as £[G'(y,f)] > 0. Define 
g l (y) := EfG'Cv,^)] as the expected service level in period i as a function of the 
server allocation vector y and let g:RP — Rp be a function whose /th component 
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We are now ready to formulate the problem of minimizing staffing costs subject to 
satisfying a minimum service level in each period. It is 

min c l x 

subject to Ax ^ y, 

g(y) > 0, (i) 
xeX, 

x, y ^ 0 and integer, 

where X is a compact set. The compactness of X can be easily justified in practice. It is, 
for example, impossible to hire an infinite number of employees, and there are usually 
budget constraints which impose an upper bound on x since c is generally positive. We 
also define, for future reference, 

Y := [y ^ 0 and integer: 3 0 ^ x e X and integer with Ax ^ y}. 

Note that Y is a finite set since X is compact and the entries in A are either 0 or 1. 

The functions g'(y) are expected values, and the underlying model might be so 
complex that an algebraic expression for g(y) can not be easily obtained. Therefore, 
simulation could be the only viable method for estimating g(y). In the next subsection 
we formulate (1) as an approximate problem, where the expected values are replaced by 
sample averages. 

2.2. Sample average approximation of the call center problem 

In this paper we assume that the algebraic form of the service level function g{y) 
is not available, and that its value is evaluated using simulation. Suppose we run 
a simulation with sample size «, where we independently generate the realizations 
{£ rf )3 =1 from the distribution of f , to get an estimate of the expected values g(y). Let 
g n (y) = (i/n) 532=1 G (y< toe resulting estimates and let g^iy) denote the ith 

component of g n (y). We use this notation to formulate the sample average approxima- 
tion 

min c T x 

subject to Ax ^ >\ 

gn(y)>0, (2) 
x e X, 

x,y ^ Oand integer. 

The problem above is linear except for the service level function g n (y). We assume that 
each of the component functions are concave so that we can approximate them 
with piecewise linear concave functions and solve the sample average approximation by 
using cutting plane methods. In the next subsection we discuss the concavity assumption 
in more detail. 
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2.3. Concave service levels 

Intuitively, we would expect that the service level increases if we increase the number of 
employees in any given period. We also conjecture that the marginal increase in service 
level decreases as we add more employees. If these speculations are true then g' (v) is 
increasing and concave in each component of y for all /. We will make the stronger 
assumption that g l (y) and £ (y) are increasing componentwise and jointly concave in y, 
for all i. Our initial computational results suggest that this is a reasonable assumption, 
at least within a region containing practical values of v (see section 6). Others have 
also studied the convexity of performance measures of queuing systems. Aksjn and 
Harker (2001) show that the throughput of a call center is stochastically increasing and 
directional concave in the sample path sense as a function of the allocation vector y in 
a similar setting. Analysis of the steady state waiting time of customers in an M/M/s 
queue shows that its expected value is a convex and decreasing function of the number 
of servers s (Dyer and Proll, 1977), its expected value is convex and increasing as a 
function of the arrival rate (Chen and Henderson, 2001) and its distribution function 
evaluated at any fixed value, is concave and decreasing as a function of the arrival rate 
(Chen and Henderson, 2001). See other references in Chen and Henderson (2001) for 
further studies in this direction. Koole and van der Sluis (2003) developed a local search 
algorithm for a call center staffing problem with a global service level constraint. When 
the service level constraint satisfies a property called multimodularity their algorithm is 
guaranteed to terminate with a global optimal solution. There are, however, examples 
where the service level constraint, as defined in this paper, is not multimodular even for 
nondecreasing and concave service level functions. 

If the concavity assumption holds, then we can approximate the service level func- 
tion with piecewise linear concave functions, which can be generated as described below. 
The following definition is useful 

DeBnition 1 (Rockafellar, 1970, p. 308). Let y k e W be given. If h :R' — R is 
a concave function and q(y k ) e R p is such that 

h(y) < h(y k ) + q(y k ) T (y - /) Vy e R p (3) 
then q(y k ) is a subgradient of h at y k . 

The term "supergradient" might be more appropriate since the hyperplane 
[Hy ) + q(y k ) T {y - y k )} lies above the function //, but we use "subgradient" to con- 
form with the literature. A concave function has at least one subgradient at every point 
(see theorem 3.5.2 in Bazaraa, Sherali, and Shetty (1993)). The notion of concavity and 
subgradients is defined for functions of continuous variables, but we are dealing with 
functions of integer variables. We say that such a function h is concave if no points of 
the form (x, h(x)) e R' +l (with x e Z p ) lie in the interior of the convex hull of the 
set l(y, /;(y)): y e Z»} c R*+». We replace R p with Z*> in definition 1 to define the 
subgradient of a function with integer domain. 
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Let q'tf) and #(/) be subgradients at y* of and |j, respectively. There are 
many potential methods one might consider to obtain the subgradients. Finite differ- 
ences using differences of length 1 appear reasonable since we are working with integer 
variables. There are, however, examples where that fails to produce a subgradient, even 
for a concave nondecreasing function. Still, we used finite differences in our numerical 
study and converged to an optimal solution of the sample average approximation. Gra- 
dients might also be obtained using infinitesimal perturbation analysis (IPA) (see, e.g., 
Glasserman (1991)). Before using IPA we would have to extend the service level func- 
tion to a differentiable function defined over a continuous domain, since IPA is applied 
in settings where the underlying function is differentiable. 

The subgradients are used to approximate the service level constraints. Let y* be 
a given server allocation vector, and suppose that £(y*) and #(/) are obtained via 
simulation. If our assumptions about concavity hold then by definition 1 we must have 
8n(y) < gn(y k ) + <?i(y*) r (y - y k ) for ail allocation vectors y, and all /. We want y to 
satisfy g n (y) > 0 and therefore it is necessary that 

+ (4) 

for all i . 

In the next section we show how to use the subgradients in a cutting plane algorithm 
to solve the sample average approximation (2). 



3. The cutting plane method 

In this section we present a cutting plane algorithm for solving the sample average ap- 
proximation (2). We select a fixed sample size at the beginning of the algorithm and use 
the same sample (common random numbers) in each iteration. This minimizes the effect 
of sampling in that we only work with one function g n instead of getting a new g n func- 
tion in each iteration, which could, for example, invalidate the concavity assumption. 

The typical cutting plane algorithm for (2) works as follows. We relax the nonlinear 
service level constraints to convert the call center staffing problem into a linear integer 
problem. We solve the linear integer problem and run a simulation with the staffing 
levels obtained from the solution. If the service levels meet the service level constraints 
as approximated by the sample average then we stop with an optimal solution to (2). If a 
service level constraint is violated then we add a linear constraint to the relaxed problem 
that eliminates the current solution but does not eliminate any feasible solutions to the 
sample average approximation. 

Our algorithm fits the framework of Kelley's cutting plane method (Kelley, Jr., 
1960). It differs from the traditional description of the algorithm only in that we use a 
stimulation to generate the cuts and evaluate the function values instead of having an 
algebraic form for the function and using analytically determined gradients to generate 
the cuts. Nevertheless, we include a proof of convergence of our cutting plane method, 
since its statement is specific to our algorithm and it makes the results clearer. 
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The relaxed problem for (2) that we solve in each iteration is 



min c T x 



subject to Ax ^ y. 

D k y > d k , (5) 

X 6 A\ 

x, y ^ Oand integer. 

We replaced the constraints g n (y) ^ 0 with linear constraints D k y > d k . The sub- 
script k indicates the iteration number in the cutting plane algorithm. The constraint set 
Dky ^ d k is initially empty but we add more constraints to it as the algorithm evolves. 

At iteration k we solve an instance of (5) to obtain the solution pair (**, y*). For 
the server allocation vector y* we run a simulation to calculate g n (/). If we find that the 
service level is unacceptable, i.e., if £ (y*) < 0 for some / , then we add the constraint (4) 
to the set D k y > d k , i.e., we add the component (y*) + £ (/) V to d k and the row 
vector q n (y ) to D k . We add a constraint for all periods i where the service level is un- 
acceptable. Otherwise, if the service level is acceptable in all periods then we terminate 
the algorithm with an optimal solution to the sample average approximation (2). 

Algorithm 1. 

Initialization. Generate n independent realizations from the distribution of £. Let 

k <- 1, D x and d { be empty. 
Step 1. Solve (5) and let (x k , y k ) be an optimal solution. 
Step la* Stop with an error if (5) was infeasible. 
Step 2. Run a simulation to obtain g n (y k ). 

Step 2a. Ug n (y k ) 0 then stop. Return y k ) as an optimal solution. 

Step 3. Compute, by simulation, <?;,(/) for all / for which gUy k ) < 0, and add the 

cuts (4) to D k and d k . 
Step 4. Let<4+i <4 and D* + , +- D k . Let k +- k + 1. Go to step 1. 

It is usually not necessary to store the n independent realizations referred to in the 
initialization phase. Instead, we only need to store a few numbers, called seeds, and reset 
the random number generators (streams) in the simulation with the seeds at the beginning 
ot each iteration. See Law and Kelton (2000) for more details on this approach to using 
common random numbers. To speed up the algorithm it is possible to start with D, and 
</, nonempty. Ingolfsson and Cabral (2002) developed, for example, lower bounds on v. 
They point out that if there is an infinite number of servers in all periods except period i 
and if y, is the minimum number of employees required in period i in this setting so that 
the service level in period i is acceptable, then y, ^ y, for all y satisfying *(v) > 0. We 
could select D, and d { to reflect such lower bounds. 

If the algorithm terminates in step la then the sample average approximation is 
infeasible. That could be due to either a sampling error, i.e., the sample average approx- 
imation does not have any feasible points even though the original problem is feasible, 
or that the original problem is infeasible. As a remedy, either the sample size should be 
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increased, or the original problem should be reformulated, e.g., the acceptable service 
level should be lowered, or more employees should be allocated (expand X). 

In the algorithm above we solve an integer linear program and add constraints to it 
in each iteration until we terminate. The integer linear problem always has a larger fea- 
sible region than the sample average approximation (2), so c T x k ^ c T x k + { ^ c T x\ 
where (x*,y*) is an optimal solution for (2). An important question is whether 
limjt-wco c T x k = c T x*. The following theorem answers this question in the positive. 

Theorem 1. 

1. The algorithm terminates in a finite number of iterations. 

2. Suppose that each component of g n is concave in y. Then the algorithm terminates 
with an optimal solution to (2) if and only if (2) has a feasible solution. 

Proo/ L Y is a finite set and it is therefore sufficient to show that no point in Y 
is visited more than once. Suppose that the algorithm did not terminate after visiting 
point y . That means that g n (/ ) ^ 0 and we added one or more cuts of the form 

o<*i(y) + ffi(y l ) r (y-y)- ie[\ p] 

to (5). Suppose that y* = y', for some k > /. Since y k is the solution for (5) at step k it 
must satisfy the cuts added at iteration /, i.e., 0 ^ si(y') + q i n (y t ) T (y k - y') = g* (y'), 
which is a contradiction because this constraint was added since g^iy') < 0. Hence, 
we visit a new point in the set Y in each iteration and thus the algorithm terminates in a 
finite number of iterations. 

2. Suppose first that (2) does not have a feasible solution. Then no y e Y satisfies 
In 00 ^ 0. The algorithm only visits points in K, so the optimality condition in step 2a 
is never satisfied. Since the algorithm terminates in a finite number of iterations it must 
terminate with the relaxed problem being infeasible. Suppose now that (2) is feasible. 
The problem (5) solved in step 1 is a relaxed version of (2) since g n is concave, so (5) 
is feasible in every iteration. Therefore, the algorithm terminates in step 2a with (jc*, y k ) 
as the solution. But g n (y k ) ^ 0 by the termination criteria, so it is an optimal solution 
to (2). D 

The theorem above states that we terminate with an optimal solution to the sample 
average approximation so long as one exists. In the next section, we discuss the conver- 
gence of that solution to an optimal solution to the original problem (1) as the sample 
size n increases. 

4. Convergence of solutions of the sample average approximation to solutions of 
the original problem 

We have established that the cutting plane algorithm will identify an optimal solution of 
the problem (2). The problem (2) was fonmed by approximating the expected service 
level constraints of problem (1), and we will next investigate if solutions of the sample 
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average approximation converge to a solution of the original problem. We show, by 
using the strong law of large numbers (SLLN), lhat the set of optimal solutions of the 
sample average approximation is a subset of the set of optimal solutions for the original 
problem w.p. 1 as the sample size gets large. Furthermore, we show that the probability 
of this event approaches 1 exponentially fast when we increase the sample size. These 
results require the existence of at least one optimal solution for the original problem 
to satisfy the expected service level constraints with strict inequality, but this regularity 
condition can be easily justified for practical purposes as will be discussed later. 

4.1. Almost sure convergence of optimal solutions of the sample average 
approximation 

The results in this section may be established by specializing the results in Vogel (1994). 
We choose to provide direct proofs in this section for 3 main reasons: 

1. The additional structure in our setting allows a clearer statement and proof of the 
results. 

2. The proofs add important insight into why solving the sample average approximation 
is a sensible approach. 

3. The proofs serve as an excellent foundation to develop an understanding of the "rate 
of convergence" results that follow in section 4.2. 

The effect of the sampling on the optimization problem is to change the shape of 
the feasible region. It directly affects the service level constraint, so we will rewrite the 
problems (1) and (2) to make the effect more transparent and to make the proofs easier 
to read. First define 

f(y) : = _ min c T x, 

U^O and integer x€X, Ax^y] 

where f(y) = +oo if the set {jc ^ 0 and integer: x e X, Ax ^ y} is empty. Now 
we can rewrite problem (1) as 

min f(y) 

subject to y e K, ($) 

8(y) >0 

and its sample average approximation, which is equivalent to (2), as 

min f(y) 

subject to v € K, (7) 
g n {y) >0. 

We are interested in the properties of the optimal solutions of (7) as the sample size 
n gets large. It turns out, by an application of the SLLN, that any optimal solution of 
(6) that satisfies g(y) > 0, i.e., g*( y ) > 0 for all /, is an optimal solution of (7) with 
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probability 1 (w.p. i) as n goes to infinity. We make a few more definitions before we 
prove this. Let 

ioo(v) := Um g n (y), 

n—»co 

:= the optimal value of (6) 

and define the sets 

Y * :— the set of optimal solutions to (6). 
Y*:= [yer:g(y)>Ol 
Y x :={yeY: f(y)<F\ g(y) ? 0}, 
Y* := the set of optimal solutions to (7). 

Note that 7, is the set of solutions to (6) that have the same or lower cost than an optimal 
solution, and satisfy all constraints except the service level constraints. We are concerned 
with solutions in this set since they could be feasible (optimal) to the sample average 
approximation (7) if the difference between the sample average, g n , and g is sufficiendy 
large. We show that when Y* is not empty, Y+ c Y* C y* for all n large enough 
w.p. 1. We say that property E(n) holds for all n large enough w.p. 1 if and only if 
P[3N < oo: E(n) holds V/i ^ N] = 1. (Here N should be viewed as a random 
variable.) Sometimes such statements are communicated by saying that E(n) holds 
eventually. 

We start with two lemmas. The first one establishes properties of g^iy) by repeat- 
edly applying the SLLN. The second shows that solutions to (6) satisfying g(y) > 0, 
and infeasible solutions, will be feasible and infeasible, respectively, w.p. 1 for problem 
(7) when n gets large. The only condition g(y) has to satisfy is that it has to be finite for 
all y € Y. That assumption is easily justified by noting that the absolute value of each 
component of g(y) is bounded by the expected number of arrivals in that period, which 
would invariably be finite in practice. 

Even though we restrict attention to optimal solutions, the overall approach would 
not change if we wanted to prove that all "^61™^ feasible solutions for (6) are eventu- 
ally feasible for (7) w.p. 1 and that all infeasible solutions for (6) are eventually infeasible 
for (7). This may lend some intuition, since it will then almost invariably be the case that 
the feasible region of the sample average approximation converges to the feasible region 
of the original problem and therefore the set of optimal solutions converges. Define 

11*11 = max|||Ky)|L = ^^J^^l 

Lemma 2. 

1. Suppose that HsCvJHco < oo for some fixed y e Z p , y ^ 0. Then g^(y) = g(v) 
w.p. 1. 

2. Suppose that ||^|| < oc and f c y. Then g^(y) = g(y) Vy e f w.p. 1. 
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Proof. 1. The SLLN (see theorem 6.1 in Billingsley (1995)) gives g^iy) = g'(y) 
w.p. 1. So 

p 

P[g<»(y) = g(y)] p[i^(v) # ,'(,)] = i. 

2. Note that 

/^(y) = *(y) Vy e r] ^ 1 - ^ /^(y) ^ g(y) ] = i 

since T is finite. g 

Lemma 3. Suppose that ||^|| < oo. Then 

1- g n (y) ^ 0 Vy e r 0 * for alln large enough w.p. 1. 

2. All y e 7i are infeasibie for the sample average approximation (7) for all n lame 
enough w.p. 1. 

Proof. 1. The result is trivial if K 0 * is empty, so suppose it is not. Let 

e — min min {^(v)). 

Then € > 0 by the definition of K 0 *. Let 

JV 0 = inf(/i 0 : - SOOIloo < € VO*o|, 

with the infimum defined as +oo if the set is empty, and then g n > 0 Vy e y Q * V/z ^ N 0 
Now, the set K 0 * is a subset of K, so lirrw^Cy) = giy) Vy e YJ w.p. 1 by part 2 of 
lemma 2. Therefore, N 0 < oo w.p. 1. 

2. The result is trivial if Y\ is empty, so suppose it is not. Let 
€ = min max I— £'(y)}. 

Then <r > 0, since g i (y) < 0, for at least one / € {1 p] Vy € Y { . Let 

= inf{n,: ||s(y) - ^(y)^ < € Vn > *,} 
and then all y e K, are infeasibie for (7) for all n S> N { . Now, the set Y x is a subset of 7, 
so linWco g n (y) = g(y) Vy e Y { w.p. 1 by part 2 of lemma 2. Therefore, M < oo 
w.p. 1. □ 

Lemma 3 shows that all the "interior" optimal solutions for the original problem 
are eventually feasible for the sample average approximation and remain so as the sam- 
ple size increases. Furthermore, all solutions that satisfy the constraints that are common 
for both problems, but not the service level constraints, and have at most the same cost 
as an optimal solution, eventually become infeasibie for the sample average approxi- 
mation. Hence, we have the important result that for a large enough sample size an 
optimal solution for the sample average approximation is indeed optimal for the original 
problem. 
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Theorem 4 Suppose that \\g\\ < oo. Then Y* c y* for all n large enough w.p. 1. 
Furthermore, if y 0 * is nonempty then Y* C Y* c y* for all n large enough w.p. 1. 

Proof. The first inclusion holds trivially if K 0 * is empty, so assume that K 0 * is not 
empty. On each sample path let N = sup{Af 0 , A/,}, where N 0 and N } are the same 
as in lemma 3. When n 5* N we know that all y € F 0 * are feasible for (7) and that all 
y e Yi are infeasible for (7). Hence, all y e K 0 * are optimal for (7) and no y £ Y* 
is optimal for (7) whenever n > N. Thus, Y* C r„* c Y* for aU n ^ N. Finally, 
P[N < oo] = P[# 0 < oo, N { < oo] ^ P[N 0 < oo] + < oo] - 1 = 1. □ 

Corollary 5. Suppose that < oo and that (1) has a unique optimal solution, y*, such 
that g(y*) > 0. Then y* is the unique optimal solution for (2) for all n large enough 
w.p. 1. 

Proof In this case 7 0 * = Y* = [y*} and the result follows from the previous theo- 
rem. □ 

The conclusion of theorem 4 relies on existence of an "interior" optimal solution 
for the original problem. A simple example illustrates how the conclusion can fail if 
this requirement is not satisfied. Let £ be a uniform random variable on [-0.5, 0.5] and 
consider the following problem: 

min y 

subject to y>|£[|]|, 

y ^ 0 and integer. 

Then y* =0 for this problem since £[f ] = 0. We form the sample average approxima- 
tion by replacing E[%] with £„, the sample average of n independent realizations of £. 
Then 0.5 > |f n | > 0 w.p. 1 for all n > 0 and thus we get that y* = 1 w.p. 1. 

We mentioned earlier that the existence of an "interior" optimal solution is merely 
a regularity condition. In reality it is basically impossible to satisfy the service level con- 
straints in any period exactly, since the feasible region is discrete. Even if this occurred, 
we could subtract an arbitrarily small positive number, say e , from the right-hand side of 
each service level constraint and solve the resulting e -perturbed problem. Then all solu- 
tions with g l (y) = 0 for some i satisfy g' (y) > -e and it is sufficient for the problem 
to have an optimal solution (not necessarily satisfying g(y) > 0) for theorem 4 to hold. 
This rationale also applies to die next subsection where we prove an exponential rate of 
convergence as the sample size increases. 

42. Exponential rate of convergence of optimal solutions of the sampled problems 

In the previous subsection, we showed that we can expect to get an optimal solution for 
the original problem (1) by solving the sample average approximation (2) if we choose 
a large sample size. In this section we show that the probability of getting an optimal 
solution this way approaches 1 exponentially fast as we increase the sample size. We use 
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large deviations theory and a result due to Dai, Chen, and Birge (2000) to prove our state- 
ment. Vogel (1988) shows, under weaker conditions, that the feasible region of a sample 
average approximation for a chance constraint problem approaches the true feasible re- 
gion at a polynomial rate and conjectures, without giving a proof, that an exponential 
rate of convergence is attainable under similar conditions to those we impose. 

The following theorem is an intermediate result from theorem 3.1 in Dai, Chen, 
and Birge (2000). 

Theorem 6. Let H:R P x Z — ► R and assume that there exist y > 0, #o > 0 and 
t] : Z R such that 

|W(y,f)|<Kif«), 4e*»*>] <oo, 

for all y e R p and for all 0 ^ 0 < 0 O > where £ is an integer valued random variable. 
Then for any S > 0, there are a > 0, b > 0, such that for any y e R p 

P[\h(y)-h n (y)\>$]^ae- b \ 

for all n > 0, where h(y) = E[H(y, £)], and h n (y) is a sample mean of n independent 
and identically distributed realizations of H{y, §). 

In our setting take H(y y f ) = G'fr, £) and note that |G'(y, £)| ^ N* (£), where N* 
is the number of calls received in period i. If the arrival process is, for example, a (non- 
homogeneous or homogeneous) Poisson process, which is commonly used to model 
incoming calls at a call center, then N' satisfies the condition of theorem 6 since it is a 
Poisson random variable* which has a finite moment generating function. 

Before we prove the exponential rate we prove a lemma mat shows that for any n, 
*o* ^ Y n ^ precisely when all the solutions in K 0 * are feasible for the sample average 
approximation, and all infeasible solutions for (6) that are equally good or better, i.e., 
are in the set Y\ , are also infeasible for (7). 

Lemma 7. Let n > 0 be an arbitrary integer. The properties 

1. g n (y)>0VyeY 0 \xid 

2. g n (y)2 0VyeY { 

hold if and only if K n * c Y* c y*. 

Proof. Suppose properties 1 and 2 hold. Then by property 1 all y e Y£ are feasible 
for (7) and the optimal value of (7) is at most F\ By property 2 there are no solutions 
with a lower objective that are feasible for (7), so Y^ c Y*. By property 2, no solutions 
outside Y* with objective value equal to F* are feasible for (7). Hence, Y* c Y* c Y*. 

Suppose 7,; c y; c y*. Then F* is the optimal value for (7). Now, since all 
y e Y£ are optimal for (7) they are also feasible for (7) and property 1 holds. All v 6 Y x 
are infeasible for (7) since Y* C K* and therefore property 2 holds. □ 
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Theorem 8. Suppose G l (y,£) satisfies the assumptions of theorem 6 for all i e 
{1, p) and that y fl * is nonempty. Then there exist a > 0, P > 0 such that 

Pwo/ Define 

5i:=min min {g'OO}, 
i(y):=arg max {-^(y)}. 

•€{l *r ' 

5 2 :=min{--g ,<y) ()0}, and 

^:=min{5i,5 2 ). 

Here £j > 0 is the minimal amount of slack in the constraints KK g(y) ^ 0" for any 
solution v e K 0 * . Similarly 8 2 > 0 is the minimal violation in the constraints "g(y) ^ 0" 
induced by any solution y e Y\ . Thus, 

p[y£ c r; c r] 

= P[f -00 5* 0 Vy e K.\ g m (y) ? 0 Vy <= K,] (8) 

= 1 - P[g n (y) 2 0 for some y e or g n (y) ^ 0 for some y e Y { ] 
p 

> 1 - E E p Kw < °] - E p [i-w > °] w 

-E'n^w-^wi^'] do) 

> l-ae^". 



Here 



a = l*o I E + E 2111(1 £ = min 6,, 

where | K 0 " | is the cardinality of the set K 0 *. Equation (8) follows by lemma 7 . Equation (9) 
is Boole's inequality. Equation (10) follows since P[g n (y) ^ 0] ^ P[gn y) (y) ^ 0] 
and g*(y) ^8\^& for >' € Y { \ and g i(y) (y) ^S 2 ^S for yeY { . Equation (11) follows 
from theorem 6. □ 
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The case where Y^ is empty but Y* is not would almost certainly never arise in 
practice. But in such a case one can solve an ^-perturbation of (2) as described at the 
end of section 4. 1, and the results of theorem 8 hold for 0 < e < S. 

5. Numerically checking the concavity of a function 

The success of the cutting plane algorithm relies on concavity of each component of 
the service level function g n . If a component of g n is not concave, then the algorithm 
may "cut off' a portion of the feasible set and terminate with a nonoptimal solution. 
In each iteration of the algorithm we obtain new information about g n . To improve the 
robustness of the algorithm, we would like to ensure that the information we receive is 
consistent with the notion that each component of g n is concave. 

There are 2 cases to consider. The first is where the vectors q^iy) as returned by 
the simulation are guaranteed to be subgradients of g* n if g* n is concave. For example, 
this would occur if the vectors were exact gradients of the function g' n at y (assuming 
that it had a differentiable extension to R p from Z p ). In this case mere is an easy test 
for nonconcavity. as we will see. The second case, that appears more likely to occur in 
practice, is where the vectors q l n (y) are obtained using some heuristic, and are therefore 
not guaranteed to be subgradients, even if g' n is indeed concave. In this case, we may 
decide to disregard some of the potentially-unreliable "subgradient" information and 
focus only on the function values themselves. (This setting may also be useful if one does 
not have "subgradient" information at all points, as arises using the finite-differencing 
heuristic mentioned earlier. When evaluating the "subgradient" at y, we also compute the 
function value, but not gradient information, at points of the form y + e t where e, is the 
usual /th basis vector.) If the function values themselves are consistent with the notion 
that the function is concave, then we may view our heuristically-derived "subgradients" 
with some suspicion, and even drop some of them from the optimization. An alternative 
would be to attempt to restrict the feasible region to a region where the functions are 
concave. We view the analysis of the cutting plane algorithm under these conditions as 
beyond the scope of mis paper, partly because it is then possible mat we then need to 
deal with the usual difficulties of nonconvex optimization. If the function values alone 
suggest nonconcavity, then the algorithm results should be viewed with some caution. 
Indeed, values reported as optimal by the algorithm could, in this case, be nonoptimal. 
The ability to detect when the key assumption of the cutting plane algorithm may not 
apply is, we believe, a strength of our approach. 

Of course, one may either implement a check for nonconcavity either inline on 
each iteration of the cutting plane algorithm, or after the algorithm halts, or not at all. 
The choice depends on how conservative one wishes to be, and is therefore application 
dependent, and so we do not enter into a discussion of which approach to take here. 

To simplify the presentation, let us consider die concavity of a real-valued 
function f :R P R instead of Hopefully no confusion will arise since the 
previously-defined function / plays no role in mis section. We assume that we are 
given a set of points y 1 , v 2 , . . . , y* 6 R p and their corresponding function values 
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f(y l ), f(y 2 )y • • , /(/)- The tests below allow one to conclude that either / is non- 
concave, or that there exists a concave function that matches the given function values. 
Of course, the tests cannot conclude that / is concave unless they examine all points 
in its domain, so that the conclusions that these tests reach are the best possible in that 
sense. 

5.Y. Concavity check with Junction values and "subgradients" 

Suppose that we know the vectors q(y l ), q(y 2 ), . . . , q(/) in addition to the set of points 
and their function values. Here q(y v ) should have the property that // / is concave, 
then q(y v ) is a subgradient at y° (v = 1, . . . , k). If they are in fact subgradients then 
they need to satisfy (3), i.e., all k points must he below the k hyperplanes defined by 
the q (y v Ys and the corresponding function values. This means that for each point y, 
v e {1, ...,*}, we must check that 

f(y w ) ^ f(y v ) + q(f) T (y w -y v ) vu,e{i *}. (i2) 

If this inequality is violated by some v and w, then we conclude that / is not concave 
in y. Otherwise, the known values of / do not contradict the concavity assumption and 

is a concave function (see theorem 5.5 in Rockafellar (1970)), such that h(y w ) = f(y w ) 
Vu; € {1 k). In other words if (12) holds Vv e [1, . . . , k] then a concave func- 
tion exists that agrees with the observed function values and "subgradients" q{y°). 
v= 1,...,*. 

When this test is implemented in the framework of algorithm 1, where in each 
iteration k we obtain f^y** 1 ) and ?;(y* +1 ), we need only check that (for each 
period 0 the new point lies below all the previously defined hyperplanes and that all 
previous points lie below the hyperplane defined by the new "subgradient." 

5.2. Concavity check with function values only 

Now consider the case when only / is known at a finite number of points. 

We want to know whether or not there is a concave function, say A, which passes 
through / at all the given points. If such a function does not exist then we conclude that 
/ is not concave. (This problem appeared in Murty (1988, p. 539).) 

We present a method where we solve a linear program (LP) and draw our con- 
clusions based on the results of the LP. The idea behind this method is that if a one- 
dimensional function is concave then it is possible to set a ruler above each point and 
rotate it until the function lies completely below the ruler. This can also be done when 
dealing with functions of higher dimensions, and then the ruler takes the form of a plane 
(p — 2) or a hyperplane (p > 2). 

The LP changes the given function values so that a supporting hyperplane for the 
convex hull of the points can be fitted through each point. The objective of this LP is to 
minimize the change in the function values that needs to be made to accomplish this goal. 
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If the changes are measured in the L { - or L«,-norm then the objective function is linear. 
The LP also gives an idea of how far, in some sense, the function is from being concave 
if a concave function cannot be fitted through the given points. If a concave function can 
be fitted then the LP will return such a function, namely the pointwise minimum of the 
hypeiplanes computed by the LP. 

It is most straightforward to use the L, -norm to measure the changes in the function 
values. Then the LP can be formulated as follows: 
k 

min 

subject to (13) 
<*ov + (a v ) V = /(/) +b v Vw € (1, . . . , *}, 

aov + {a v ) T y w > f{y w )+b w Vv e {1, . . . , k) Vu; 6 {1 *}, w # v. 

To linearize the objective function we adopt the standard trick of writing b v = b+ - b~ 
and replace \b v \ with b+ + where £?+ and b~ are nonnegative. The decision variables 
are: 

oq v 6 R, v 6 {1 , . > . , k}: intercepts of the hyperplanes, 
a v e R', v e { 1 , . . . , k): slopes of the hyperplanes and 
b+, b~ 6 R, v g (1, k}: change in the function values. 

The number of variables in this LP is k(p + 1) + 2k = k(p + 3) and the number 
of constraints is k + k(k - I) = k 2 . We could split the LP up into k separate linear 
programs if that would speed up the computations, as might occur if we could run them 
on multiple processors in parallel, or if the LP solver was unable to detect the separable 
structure in this problem and exploit it. Here, the uth separate linear program tries to fit 
a hyperplane through the point (y y , / (y u )) that lies above all other points. 

The LP is always feasible, since a feasible solution is given by a v = 0, a 0v = 0 and 
b v = -f(y°) for all v e (1, ... , k}. It is also bounded below by 0, since the objective 
function is a sum of absolute values. Therefore, this problem has a finite minimum. If 
the minimum value is 0, then the function defined by 

h(y) := inf ^ v + (a v ) T y 

v—l k v 7 

is concave and f(y v ) = h(y v ) for all v e {1, . . k}. On the other hand, if / is in- 
deed concave, then there exists a subgradient at every point of / (see theorem 3.2.5 in 
Bazaraa, Sherali, and Shetty (1993)) and hence the constraints of the LP can be satisfied 
with b v = 0 for all v e {1, . . . , *}. We have proved the following result. 

Theorems Consider the LP (13). 

1. If the optimal objective value of the LP is 0 then there exisLs a concave function h(y) 
such that h ( y v ) = f(y ") for all v e [ 1 , . . . , *}. 

2. If / is concave then the optimal objective value of the LP is 0. 
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So we see that a necessary condition for / to be concave is that the optimal objec- 
tive value of the LP ( 1 3) is zero. Thus we have the following corollary. 

Corollary 10. If the optimal objective value of the LP (13) is positive, then / is not 
concave. 

Note that the hyperplanes obtained from the LP are generally not subgradients 
of /, so we cannot use them in algorithm I as such. Hence, we have to solve this LP 
after step 2 in each iteration, or as a check after the algorithm terminates. Given the 
computational demands of the cutting plane algorithm, repeatedly solving this LP in 
each iteration does not represent a significant increase in computational effort. 

6. Computational study 

In this section we present a small numerical example that showcases the properties of 
our method. The example is far from being a realistic representation of a call center, but 
captures many issues in setting call center staffing levels. We will study 3 aspects of the 
problem in the context of the example; 

1. Convergence of the cutting plane algorithm and the quality of the resulting solution. 

2. Dependence of the service level in one period on staffing levels in other periods. 
This is of particular practical interest since traditional methods assume independence 
between periods. 

3. Concavity of g n (y). 

Our implementation creates the integer programs (5) in AMPL and uses the 
CPLEX solver to solve them in step I of the algorithm, and a simulation model built 
in ProModel to perform steps 2 and 3. We used Microsoft Excel to pass data between 
the simulation and optimization components and to run the iterations of the algorithm. 
The implementation was exactly as described in algorithm 1 except for the initialization, 
where we started with y 1 at the lower bounds described in section 3 instead of starting 
with Di and d { empty. 

6.1. Example 

We consider an M (t)/M/s(t) queue with p = 5 periods of equal length of 30 minutes. 
We let the service rate be p = 4 customers/hour. The arrival process is a nonhomoge- 
neous Poisson process with the arrival rate a function of the time / in minutes equal to 
k(t) = A.(l - |//150 - 0.65|), i.e., the arrival rate is relatively low at the beginning of 
the first period, then increases linearly at rate X until it peaks partway through the fourth 
period and decreases at rate k after that. We set A. = 120 customers/hour, which makes 
the average arrival rate over the 5 periods equal 87.3 customers/hour. 
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Tabte 1 

The iterates of the algorithm and the resulting service level function values and their 95% confidence inter- 
vals (CI). f(y k ) is the objective value at y*. 



g m {y k ) ± 95% CI half width 
v* (% of calls received that are answered in less than 90 sec.) f(y k ) 



I 


11 19 


27 


30 


29 


0.4 ±1.0 


-1.1 ±2.2 


-1.8±3.1 


-3.5 ±3.2 


-2.1 ±2.3 


125.0 












(81.5%) 


(77.2%) 


(76.6%) 


(73.8%) 


(75.5%) 




2 


11 21 


27 


33 


29 


0.5 ± 0.9 


3.4 ±1.5 


0.2 ± 2.6 


4.1 ±2.2 


-0.3 ± 2.7 


127.5 












(81.9%) 


(88.5%) 


(80.5%) 


(87.4%) 


(79.4%) 




3 


11 21 


27 


34 


29 


0.5 ±0.9 


3.4±1.5 


0.4 ± 2.6 


5.8 ± 1.8 


0.0 ±2.6 


128.0 












(81.9%) 


(88.5%) 


(80.7%) 


(90.4%) 


(80.0%) 





The goal is to answer 80% of received calls in each period in less than 90 seconds. 
The customers form a single queue and are served on a first come first serve basis. If 
a server is still in service at the end of a period it finishes that service before becoming 
unavailable. For example, if there are 8 busy servers at the end of period 3 and period 4 
only has 6 servers then the 8 servers will continue to serve the customers already in 
service, but the next customer in the queue will not get service until 3 customers have 
finished service. 

There are 6 permissible tours, including 4 tours that cover 2 adjacent periods, i.e., 
periods 1 and 2, 2 and 3, 3 and 4, and finally 4 and 5. The remaining 2 tours cover only 
one period, namely the first and the last. The cost of the tours covering 2 periods is $2 
and the single period tours cost $1.50. 

6.2. Results 

We selected a sample size of n = 100 for running the algorithm. The lower bounds on 
y are depicted in the row k = 1 in table 1. Note that the staffing levels at the lower 
bounds result in an unacceptable level of service and thus a method which would treat 
the periods independently, would give an infeasible solution, since the service level is 
as low as 73.8% in period 4. The algorithm terminates after only 3 iterations with an 
optimal solution to the sample average approximation. To verify that this is indeed an 
optimal solution we ran a simulation for all staffing levels that have lower costs than 
the optimal solution and satisfy the initial lower bounds. None of these staffing levels 
satisfied gwo(y) ^ 0, so the solution returned by the algorithm is the optimal solution 
for the sample average approximation. By including the 95% confidence interval we get 
information about the quality of the solution as a solution of the original problem. In the 
example, the confidence intervals in periods 1, 3 and 5 cover zero, which is a concern 
since we cannot say with conviction that our service level is acceptable in those periods. 
To get a better idea of whether the current solution is feasible for the original problem 
we calculated £<m(y 3 ) = (0.5 ± 0.3, 3.0 ± 0.5, 2.3 ± 0.7, 5.1 ± 0.7, 0.0 ± 0.8) r , so we 
are more confident that the service levels in periods 1 and 3 are acceptable. The service 
level in period 5 is close to being on the boundary, hence our difficulty in determining 
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8 9 10 11 12 13 14 13 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 

y 




100% 
93% 
90% 
83% 
80% 
A 73% 
7*70% 
65% 
60% 
55% 



50% 



-*-y3-23 
♦y3-27 
"*-y>29 



25 26 27 28 29 30 4 31 32 33 M 35 36 37 

y 

Figure 1 . Dependence of staffing levels on the service level in period 3 of the example in section 6.1 . 

whether the solution is feasible or not. From a practical standpoint, if we are infeasible, 
then we are so close to being feasible that it probably is of little consequence. 

We already noted that there is dependence between periods. To investigate the 
dependence further we calculated f\ (>'), the percentage of calls received in period 3 
answered in less than 90 seconds, i.e., f\(y) := Y!d=\ S 3 C V < LLi ^ 3 (>'. We 
chose period 3 to demonstrate how the service level depends on staffing level in both the 
period before and after. Figure 1 illustrates this point. The graphs show the service levei 
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Table 2 

Concavity study. Low, medium and high staffing levels in each period and the 
optimal values of (13). 



Period 




2 


3 


4 


5 


Low 


10 


18 


26 


29 


29 


Medium 


12 


20 


28 


31 


31 


High 


14 


22 


30 


33 


33 


Optimal value 


0.0 


0.0 


4.0 • 10 -3 


1.5 - KT 2 


5.7 • 10~ 4 



in period 3 as a function of the staffing level in period 3 (l.a), period 2 (l.b) and period 4 
(l.c) when the staffing levels in other periods are fixed. The service level depends more 
on the staffing level in the period before than the period after as could be expected. That 
is because a low staffing level in an earlier period results in a queue buildup, which 
increases waiting in the next period. The reason why the staffing level in a later period 
affects the service level in an earlier period is that customers that called in the earlier 
period may still be waiting at the beginning of the next period and thus receive service 
earlier if there are more servers in that period. We noted dependence between periods 
as far apart as from the first period to the last. Figure 1 also supports the concavity 
assumption of the service level function when y is within a region of reasonable values, 
i.e., at least satisfies some lower bounds. It is, however, clear that the service level 
function looks like an s-shaped function over a broader range of y's as pointed out by 
Ingolfsson and Cabral (2002). That would not be problematic if one were to include the 
aforementioned lower bounds on y and if the concavity assumption holds for all v above 
the lower bounds. 

We also performed a separate concavity check based on the method in section 5.2. 
In an effort to demonstrate these ideas as clearly as possible we performed the concav- 
ity check outside the scope of the cutting plane algorithm itself, using a selection of 
points that appear reasonable from a performance standpoint. We used a sample size of 
300 and calculated ihoo(y) at 3 different staffing levels (labelled low, medium and high 
in table 2) for each period, i.e., at 3 5 = 243 points. We solved the linear program (13) 
for each g\ m (y), » e {1, . . . , 5}, and obtained the results in table 2. We see that the 
service level functions in periods 1 and do not violate the concavity assumption at 
the observed points. The other functions violate the concavity condition. The values 
of the b v \ the changes needed to satisfy the concavity assumption are all small, 
as can be seen by the objective value. We examined the points at which nonconcavity 
was detected, and noted that they occurred when a change in staffing level in a different 
period was made. (It is a strength of the LP-based concavity check that we were able to 
discover a region where the nonconcavity was exhibited.) The service level in period 3 
increased, for example, more when the staffing level in period 1 was increased from 12 
to 14 than when it was increased from 10 to 12 at staffing levels 22 and 30 in periods 2 
and 3, respectively. The reason for this violation of the concavity assumption is not 
obvious. 
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One possible explanation is that our measure of service quality is binary for each 
customer, so that "rounding" may contribute to the nonconcavity. To elaborate, in the 
above example it is possible that unusually many customers exceed the waiting time limit 
of 90 seconds by very little when there are 12 servers in period I, so that the effect of 
adding servers at this staffing level is more than when servers are added at a lower level. 
We would expect such a "rounding" effect to be averaged out in a longer simulation. 
In fact, we increased the sample size to 999 (the maximum number of replications in 
ProModel 4.2) and calculated the service level at the problematic points. We discovered 
that the nonconcavity vanished. Therefore, we make the following conjecture. 

Conjecture 1L For M(t)/M/s(t) queues of the form considered here there exists a 
finite y 0 ^ 0 such that the service level function g is nondecreasing and concave in y 
in the region y ^ y 0 . Furthermore, g n is nondecreasing and concave in y in the region 
yo ^ y ^ yi for all n large enough w.p. 1, for any fixed y x ^ y 0 . 

7. Conclusions 

In this paper we have shown that combining simulation and cutting plane methods is a 
promising approach for solving optimization problems in which some of the constraints 
can only be assessed through simulation. As a motivating example we studied the prob- 
lem of niinimizing staffing costs in call centers when traditional methods fail, either 
because of the characteristics of the problem, or if a detailed model of the call center 
dynamics are desired. We performed a computational study, which supports the use of 
the cutting plane method and demonstrates how it can be implemented. 

There are several interesting directions for future research. We established the 
theoretical foundation of the method, but an obvious drawback of the method is the large 
computational effort required to solve realistic problems. More research (in progress) is 
needed to make this a practical method. In relation to the integer programs one should 
investigate integer programming algorithms that can utilize the special structure of the 
relaxed problems solved in each iteration and consider allowing approximate solutions 
of the IPs, especially in early stages of the algorithm. One might consider Lagrangian 
relaxation techniques for solving these problems, still in the context of simulation and 
optimization, since we are approximating the constraints. 

Other areas of interest include coming up with methods for obtaining subgradients 
or improving the current heuristic of using finite differences. It would add to the robust- 
ness of the method to study the properties of the solutions when some of the conditions 
set forth in this paper, e.g., the concavity of the service levels, are violated. 

We only tested our algorithm on one simple example. It would be informative to 
run the algorithm on more complicated problems and include in the simulation model 
factors such as absenteeism and skill-based routing, not to mention implementing the 
algorithm in other types of service systems than call center staffing. We are currently 
pursuing many of these issues. 
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1. Introduction 

We consider a telephone call center, or a more general 
contact center, where different types of calls arrive at 
random and different groups of agents answer these 
calls. Each type of call requires a specific skill and 
each agent group (also called skill group) has a given 
subset of these skills, so all agents in that group can 
handle the corresponding call types and only those. 
There may be preferences among these agent groups 
for a given type of call, either because some groups 
are better than others at handling it, or because we 
prefer to save certain agent groups for other call 
types. 

The calls arrive according to arbitrary stochastic 
processes that could be nonstationary, and perhaps 
doubly stochastic (see, e.g., Avramidis et al. 2004). An 
arriving call can be served immediately if an agent 
with the appropriate skill is available, or may have to 
wait in a queue. An abandonment occurs whenever the 
waiting time of a call exceeds its (random) patience 
time. That call is then lost. Skill-based routing (SBR) 
strategies are used to determine which agent group 
handles each individual call. 

The goal is to minimize the operating cost of the 
center under a set of constraints on the quality of ser- 
vice (QoS). The decisions to be made are how many 
agents of each skill group to have in the center as 
a function of time. In a staffing problem, the day is 
divided into periods (e.g., 30 minutes or one hour 
each), and one simply decides the number of agents 



of each group for each period. In a scheduling problem, 
a set of admissible work schedules is first specified, 
and the decision variables are the number of agents 
of each skill group in each work schedule. This deter- 
mines the staffing indirectly, while making sure that it 
corresponds to a feasible set of work schedules. A yet 
more restrictive version of the problem is when there , " 
is a fixed set of available agents to be scheduled for 
the day or the week, where each agent has a specific 
set of skills. Then, we have a scheduling and postering 
problem. 

In this paper, the objective function is the sum of 
costs of all agents, where the cost of an agent depends 
on its set of skills. This cost is typically modeled as a 
constant plus an increment of between 5% to 20% for 
each additional skill. The QoS constraints are that the 
fraction of calls answered within a certain time limit, 
in the long run, exceeds a given threshold. This frac- 
tion is called the service leoel. In general, we may also 
have an upper bound on the abandonment ratio (the 
fraction of calls that are lost), or constraints on other 
types of performance measures. Such constraints can 
be imposed per call type, per period, and globally, 
with different thresholds. In this paper, we consider 
only service-level constraints. 

If each agent group has a single skill, we have a 
system of separate parallel queues, one queue per call 
type. If there is a single-agent group with all skills, 
we have a single queue and (other things being equal) 
smaller waiting times than with separate queues. 
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However, agents with more skills arc more costly, so 
there is a compromise to be made in choosing the skill 
groups. Wallace and Whitt (2005) have shown empir- 
ically, in a certain setting and for properly balanced 
systems with good routing strategies, that one or two 
skills per agent often gives a performance (in terms of 
QoS) that is almost as good as for a system where all 
agents have all skills. Note that pooling servers may 
also degrade QoS in certain settings, with poor routing 
strategies; sec, e.g., Mandelbaum and Reimann (1998). 

In a more general formulation, we would also want 
to optimize the routing of the calls. In static routing, 
each call type may have an ordered list of agent groups 
to try. If all are busy, the call joins a queue. There 
could be one queue per agent group, or one queue 
per call type (this is usually better because it allows 
more resource sharing), or one single queue for several 
call types, or a mixture of these. Priorities and com- 
plex routing rules can thus be implemented. For exam- 
ple, each agent may have an ordered list of queues to 
dig from when it becomes available, and the order is 
not necessarily the same for all agents in a group. The 
priorities may change based on thresholds on queue 
sizes or waiting times, etc. In a general dynamic routing 
scheme, the decisions may depend on the entire state 
of the system, which may include the current time, 
the number of calls of each type in service and in the 
queues, the elapsed service time of the calls in service, 
etc. Optimal dynamic schemes are usually too com- 
plicated and difficult to implement, so in practice the 
routing strategies are selected from a prespecified class 
of simpler rules. In the optimization problems studied 
in this paper, they are assumed to be fixed and we opti- 
mize only the staffing or scheduling. This was moti- 
vated by a specific request from a Canadian firm who 
wanted precisely this kind of optimization tool for the 
management of their call centers. 

Figure 1 gives a schematic view of arrival, routing, 
and service in a general multiskill call center with K 
call types and / agent groups. The A*s represent the 
arrival rates for the different call types, the servers S, 
represent the skill groups, and the mean service times 
l//jtjt fl may depend jointly on the call type and skill 
group. 

For general background on call center manage- 
ment, staffing and scheduling, and multiskill centers 
with SBR, we refer the reader to Gans et al. (2003), 
Ingolfsson et al. (2003), Koole and Mandelbaum 
(2002), and Wallace and Whitt (2005). 

Atlason et al. (2004) have proposed a general meth- 
odology, based on the cutting-plane method of Kelley 
Jr. (1960), to optimize the scheduling of agents in 
a single-call-type and single-skill call center, under 
service-level constraints. Their method combines sim- 
ulation with integer programming and cut generation. 



Figure 1 A General Multiskill Call Center 
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The general idea is to optimize a relaxation of a sam- 
ple average version of the problem (which becomes 
a deterministic problem) by generating cuts from the 
violated service-level constraints and adding corre- 
sponding linear constraints until the optimal solu- 
tion of the relaxed problem is feasible for the original 
problem. They solved an example with five periods 
and six different types of work schedules (i.e., six 
integer-valued decision variables). 

Our aim in this paper is to extend their method- 
ology to the multiskill setting, explore the difficul- 
ties encountered with larger problem instances, and 
develop (heuristic) methods to deal with these prob- 
lems in a practical way. Some of the difficulties 
encountered are specific to the multiskill setting. With 
our improved methodology, we can solve (approxi- 
mately) much larger problem instances than with the 
original methodology of Atlason et al. (2004). 

The remainder of this paper is organized as follows. 
In §2, we formulate the staffing and scheduling prob- 
lems in a multiskill center. In §3, which draws largely 
from Atlason et al. (2004), we first outline the solution 
methodology, based on simulation and cut genera- 
tion. Then, we explain several difficulties encountered 
in applying this methodology and how we get around 
them to solve realistic problems. In §4, we report on 
numerical experiments for staffing problems over a 
single period, in which we assume that the system is 
in steady state. We solve problems of various sizes, 
from an artificial example with five call types and 12 
skill groups, to an example of a large real-life call 
center with 65 call types and 89 skill groups. Section 
5 contains concluding remarks. The online appendix, 
which is provided in the e-companion, contains addi- 
tional experimental results. 1 

Staffing a call center in steady state is not quite 
the same as the full-scheduling problem (PI) that we 
formulate in the next section. However, solving large 
instances of it is a first step toward solving large 
instances of (PI). We have also solved scheduling 

1 An electronic companion to this paper is available as part of the 
online version that can be found at http://mansci. journal. informs, 
on-/. 
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problems with several periods, with more than 1(H) 
types of work schedules, but only two types of agents, 
in the context of a call center operating in blend mode 
(with inbound and outbound calls), with the method- 
ology described here. 

2. Problem Formulation 

We now give a nonlinear integer programming for- 
mulation of a scheduling problem with service-level 
constraints, over a time interval divided into periods. 
There are K call types, / skill groups, P periods, and 
Q types of work schedules (which we also call shifts). 
For example, the scheduling could be for one day (or 
week) and the periods could be successive half hours 
of operation of the center over that day (or week). 
Each shift specifies the time when the agent starts 
working, when he or she finishes, and all the lunch 
and coffee breaks. 

The cost vector is c = (c u ,... / c 1 q,...,C| 1/ ... / 
Cj q) ! , where c { q is the cost of an agent of type i 
having shift q. The vector of decision variables is x = 
(x l ,1,..., r 1#Q ,...,;r M ,... ,x lQ )\ where x iq is the 
number of agents of type i having shift q. We 
use the vector of auxilian/ variables y = (yi.i,..., 
Vi.P' - • • / Via* • • • / Vi, p)'/ where y ip is the number of 
agents of type ; in period p. This vector y satisfies 
y = Ax, where A is a block diagonal matrix with I 
blocks A, where the element (p, q) of A is 1 if shift q 
covers period p, and 0 otherwise. 

The constraints are on the service level, defined as 
the fraction of calls answered within a specified time 
limit. This time limit was taken as 20 seconds in our 
computational experiments. The service level for call 
type k in period p is defined as 

gk.piy) 

E[no. of calls answered within s kp seconds in period p] 
E[no. of calls in period p] 

for some constant s k p . It is equal (with probability 
one) to the fraction of calls of type Jc answered within 
s k seconds over an infinite number of independent 
and identically distributed (i.i.d.) copies of period p. 
This fraction generally depends on the staffing over 
all periods, up to period p, because reducing the 
staffing over a given period can increase the queue 
lengths and thus reduce the service level in the peri- 
ods that follow, and may also increase the waiting 
time of calls answered in this given period but that 
arrived in earlier periods. This is why g k p is written 
as a function of y. In our model, the abandonments 
that occur before the time limit s k p are not counted, 
whereas the calls that abandon after the time limit 
are counted in the total. The service-level constraints 
are of the form g k , p (y) > l kp , where each constant l kp 
belongs to the interval [0, 1). 



The model also has constraints on the aggregate 
service level over a given call type, a given period, 
and overall. For example, the aggregate service level 
over call type k is the expected total number of calls 
of type k answered within some time limit s A over 
the day (say), divided by the expected total number 
of calls of type k received over the day. We denote 
by g p (y), gk(y), and g(y) the aggregated service levels 
for period p, call type k, and overall, respectively. The 
corresponding time limits are s p , s k , and s, and the 
corresponding minimal service levels are l pf l k , and /. 

These functions g 9 (the ratios of expectations) are 
generally unknown and much too complicated to be 
evaluated exactly. They can be either approximated 
via simplified queueing models (e.g., as in Ingolfsson 
et al. 2003) or estimated by simulation. In this paper, 
we concentrate on the second alternative. 

With this notation, we define the scheduling prob- 
lem as 

1 Q 

min c i x = J2J2 c i f q x i. q 

subject to Ax = y, 

gk.piy) >h, P for all ^p, 

gpiy) > l v for all p, 

g k (y) > l k for all k, 

g(y)>i, 

x > 0, and integer. (PI) 

The staffing problem is a relaxation of the schedul- 
ing problem where we forget about the admissibil- 
ity of schedules and just assume that any staffing 
is admissible. The cost vector in this setting is c = 
(c 1§ „ . . . , c, P , . . . , c lt u . . . , c, rP ) x , where c ip is the cost 
of an agent of group i in period p. We have 

i p 

min c t y = EE c ., P yi,;. 
i=i p=i 

subject to g kiP (y)>l k , p foralU',p, 
gp(y)Zl p for all p, 
ft(y)>It for all k, 

g(y)>i, 

y > 0, and integer. (P2) 

In the special case where we consider one period 
at a time, we have c = (cj, ...,Ct) x , where c, is the 
cost of an agent of type i, and y = (y x , . . . , i/,) 1 , where 
y, is the number of agents of type /. In this con- 
text, we often assume that the system is in steady 
state over the given period (but we may also assume 
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arbitrary initial conditions). The optimization prob- 
lem then reduces to 

i 

min c*y = 

i=i 

subject to ^.(y) > l k for all k, 

y > 0, and integer. (P3) 

Our numerical illustrations in §4 are with problem 
(P3) for a steady-state system. Solving this problem in 
itself can bring useful insight into the effect of skill-set 
selection and routing strategies on the service level. 

It is frequent practice in call centers to solve the 
single-period staffing problem independently for each 
period, under a steady-state assumption (first step), 
and then find an admissible schedule that covers the 
required staffing at minimal cost (second step). When 
this approach is used, we usually have / p = l k = l p k 
for all p and k, so the constraints on the aggregated 
levels are automatically satisfied if the constraints of 
(P3) are satisfied for each period and if the periods 
are assumed independent. Often, in practice, the con- 
straints are only on the aggregated service level over 
all the periods, and we use these same constraints in 
all copies of (P3), i.e., for each period. This two-step 
approach is definitely suboptimal and the suboptimal- 
ity gap can be significant (e.g., Jennings et al. 1996), 
but it is nevertheless often used because the original 
scheduling problem (PI) is deemed too hard to solve. 
If the staffing vector y* represents the optimal solu- 
tion obtained in the first step, and if we assume that 
this y* automatically satisfies the constraints of the 
original problem (i.e., that the time-aggregated con- 
straints are redundant), then the problem to solve in 
the second step is 

/ Q 

1=1 q=\ 

subject to Ax > y* , 

x > 0, and integer. (P4) 

To solve any of these problems, we need to approx- 
imate or estimate the functions g m . For this, we tried 
variations of a queueing approximation proposed by 
Koole and Talim (2000) and Koole et al. (2003), where 
the model is approximated by a loss system (with sev- 
eral simplifications) and a heuristic is used to infer 
the service levels in the original model from the loss 
ratios in the loss model. We compared the results with 
those of a detailed simulation model and found signif- 
icant differences in the service levels (more than 100% 
in some examples). In the remainder of this paper, we 
use only simulation to estimate the service level. 



3. General Methodology 

In this section, we summarize the main ideas of 
the general methodology proposed by Atlason et al. 
(2004), adapt it to the multiskill setting, point out sev- 
eral difficulties encountered with this methodology 
(many of them were already recognized in Atlason 
et al. 2004), and propose practical heuristics that can 
make the methodology viable for realistic problem 
instances. 

3.1. Optimization of a Sample Problem 

The service levels involved in the constraints are esti- 
mated by simulation. Suppose that we simulate the 
center n times, independently, over its P periods of 
operation. Let (o represent the source of randomness, 
i.e., the sequence of all independent U(0, 1) random 
variates that drive the successive simulation runs 
(regardless of their number). We may assume that o> is 
divided into long disjoint subsequences so that the z'th 
simulation run always starts using random numbers 
at the beginning of the ith subsequence regardless 
of the model parameters and staffing vector. This is 
implemented by using random number packages that 
provide multiple streams and substreams (L'Ecuyer 
et al. 2002, L'Ecuyer 2004). 

The empirical service level over n simulation runs, 
defined as the observed number of calls answered 
within the time limit divided by the total number of 
calls, is a function of the staffing level y and of a>. We 
denote it by G„ k p (y, w) for call type k in period p; 
G„, r (y, *>) aggregated over period /?; G„ *(y, <d) aggre- 
gated for call type k; and G„(y, o>) aggregated over- 
all. For a fixed co, these are all deterministic functions 
of y. To compute these functions at different values 
of y, we simply use simulation with common random 
numbers, i.e., make sure that the same random num- 
bers are used at the same place for all values of y 
(Law and Kelton 2000). 

If we replace the functions g % in the optimization 
problems by their sample counterparts G # , we obtain 
the following empirical scheduling problem, which is a 
sample average version of the original problem: 

/ Q 

min c l x = ^E c t,^, 9 

subject to Ax = y , 

Gn.k.p(y)>k. P for a11 k >P' 
G n p (y)>l p for all p, 

G n .k(y)>k for all/:, 
C„(y)>/, 

x > 0, and integer. (P5) 
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A similar formulation can be given for the other 
problems, replacing g by G everywhere. The single- 
period staffing problem becomes 

min c'y^XXv, 
subject to G wJt (y) > / fc for alt Jc, 

C„(y)>/, 

y > 0, and integer. (P6) 

We insist on the fact that when w is fixed, these 
sample problems become purely deterministic. These 
are the problems we solve, instead of the original 
problems (PI) to (P3). 

We can study the convergence of the optimal solu- 
tion of the sample problem to that of the original 
problem as n -> oo, under the assumption that o) is 
really an infinite sequence of i.i.d. random variables 
(as opposed to pseudorandom numbers). This was 
done by Atlason et al. (2004) for their model, and ear- 
lier, e.g., by Vogel (1994) in a more general setting. 
Our argumentation is similar but simplified. 

In our context, it is reasonable to assume that the set 
y of potential solutions y is finite (e.g., it suffices to 
impose an upper bound on the total number of agents 
in the center). Let y* be the set of optimal solutions 
of the exact problem. Another reasonable assumption 
is that no service-level constraint is satisfied exactly in 
the original problem for these solutions y € y\ Let y* 
be the set of optimal solutions of the sample problem 
based on n simulation runs. Each random variable 
G„,*, p (y, w) estimates the ratio of expectations g k p (y) 
by the ratio of two averages of n Li.d. random vari- 
ables having the correct (positive) expectations and 
finite variance. Therefore, G„ k p (y, o>) converges to 
Sk, P (y) with probability one when n — > oo. This means 
that for every 6 > 0, there is a random variable N 0 < 
oo such that \G n kfP (y, <o) - g kiP (y)\ < e for all n > N 0 . 
Let 8 be the smallest absolute difference between an 
exact service level g % and its corresponding threshold 

among all solutions y 6 y. In view of the above 
assumptions, because y is finite, there is a random 
K such that |G nJt p (y, <o) - g kiP (y)\ < 8 for all n > N # , 
all y € y, all Jc, all p, and all aggregate service levels 
as well. Then, for all n > N„ the sample problem has 
exactly the same set of feasible solutions (and there- 
fore the same optimal solutions) as the exact problem. 
We have just proved the following: 

Proposition 1. With probability one, there is an inte- 
ger N, < oo such that for all n> N 0 , y* = y\ 

The next proposition gives a convergence result in 
the sense of large deviation theory, under the assump- 
tion that the service-level estimators satisfy a standard 



large deviation principle (see, e.g., Ellis 1985, Shwart* 
and Weiss 1995). This assumption holds under fairly 
general conditions that should practically always hold 
in our setting. It typically holds with k = fc 0 e 2 for 
every e small enough, for some constant k 0 > 0. 

Assumption 1. For every € > 0, there are positive inte- 
gers n 0 and k such that for all n > n 0 and y e y, 

P(|G n . t . p (y,a,)-^, (( (y)|>e)<r-" k 

for all k, p, and for the aggregate service levels as well. 

Proposition 2. Under Assumption 1, there is a posi- 
tive constant a and an integer n+ < oo such that for all 
n > n\ 

p[y:=y*]>i-ae-»*. 

Take e = 8 in the assumption. If the absolute error 
|G„ .k.ptyf <°) - gk ,p(y)\ i s less man 8 simultaneously 
for all constraints and all y € y, then the sample prob- 
lem has exactly the same feasible set as the exact prob- 
lem and therefore y* = y\ But the probability that 
this does not happen is bounded by the sum over all 
constraints and all values of y of the probabilities that 
it does not happen for this particular constraint and 
value of y. Each such probability is bounded by e~ nK 
from the assumption and there are a finite number 
of them, so their sum must be bounded by ae~ nK for 
some constant a. 

The proof of the proposition also implies that the 
sample and exact problems have exactly the same set 
of feasible solutions with a probability that converges 
to one exponentially fast when n -> oo. It reuses an 
argument from the proof of Proposition 2 of Yakowitz 
et al. (2000). 

For solving the staffing problem (P3) in the steady- 
state case, we use the sample problem (P6), where the 
functions G n k and G n are estimated by simulating the 
model for n hours of operation. These functions con- 
verge to g k and g and obey a central-limit theorem 
(pointwise) when n — ► oo in the same way as for the 
multiperiod finite-horizon model, if we assume that 
the system has enough ergodicity. Then, if we restrict 
the search to a finite subset of the space of stable solu- 
tions y, Propositions 1 and 2 hold in this case as well. 

3.2. Integer Linear Programming with Cut 
Generation 

We solve the sample staffing and scheduling prob- 
lems by linear programming with cut generation, in 
a manner similar to Atlason et al. (2004). The idea 
is to relax the nonlinear service-level constraints and 
add progressively linear constraints until the optimal 
solution satisfies all service-level constraints. 

We give the following explanations on cut genera- 
tion with the global service-level function g, to sim- 
plify the notation. The same reasoning applies as well 
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to the service level per call type and /or per period, 
and to their sample counterparts. 

At any given step of the algorithm, let y denote 
the current solution (optimal for the relaxed problem 
with the current set of constraints). If y satisfies all 
service-level constraints, then it is an optimal feasible 
solution for our problem and we are done (assuming 
that we did not cut out the optimal solution(s) when 
adding linear constraints). Otherwise, take a violated 
constraint, say g{y) < /, and suppose that g is con- 
cave in y. (A discrete function g is called concave if 
the upper boundary of the convex hull of the points 
( v 'S(y)) is a concave function.) Let q be any subgra- 
dient of g at y, i.e., a vector that satisfies 

g{y)<g(y)+q\y-y) (i) 

for all y. We want g(y) > /, so we must have / < g(y) < 

qV>9 l y + '-S(y), (2) 

where the term on the right is a constant that can be 
readily computed. Adding this linear cut inequality to 
the constraints removes y from the current set of fea- 
sible solutions, without removing any solution that is 
feasible for the original problem if g is concave. 

Unfortunately, g is not always concave in our set- 
ting. Typically, it is concave in the areas where y 
is large, but not where y is small (this was already 
pointed out by Atlason et al. 2004; see also the next 
section). But it is quite reasonable to assume at least 
that g is nondecreasing. In that case, a subgradient 
q must necessarily satisfy q > 0 because taking y = 
y + in (1) gives q } > g(y) - g(y) > 0 for each /, and 
the next proposition provides a sufficient condition 
ensuring that constraint (2) cannot remove feasible 
solutions of the original problem. 

Proposition 3. Suppose that g is nondecreasing every- 
where in its domain and concave in the area (slice) A defined 
by the tuny linear inequalities 

I'y^qV^q'y+'-sCy). (3) 

If a vector q > 0 satisfies {\)for y e A, then the cut (2) can- 
not remove any feasible solution of the original problem. 

If q > 0 satisfies inequality (1) in A, then g(y) < / 
everywhere in the interior of A, so the cut cannot 
remove any feasible solution there. We also have 
g(y) < I everywhere in the half-space where (2) is not 
satisfied because g is nondecreasing in that half-space, 
and every point y in that half-space satisfies y < y x 
for some point y^ € A. 

The area A is the slice of space between the hyper- 
plane that determines the cut (2) and a parallel 
hyperplane that passes through the point y. We call 



a vector q > 0 that satisfies (1) for y € A a ^subgradi- 
ent at y. 

Cuts of the form (2) can be added for all violated 
constraints simultaneously in a single step, or for only 
some of the violated constraints. The next issues are: 
How do we obtain a subgradient, how can we make 
sure that g is concave, and what do we do if it is not 
concave? They are all addressed by heuristics. 

3.3. Difficulties and Heuristic Cures 

3.3.1. Getting a Subgradient. To obtain a (tenta- 
tive) A-subgradient q at y, we simply use forward 
finite differences as follows. For / = 1, . . . , JP, choose 
an integer dj > 0. Compute the function g at y and 
at y + d } ej for / = 1, — JP, where e ; is the /th unit 
vector, with a one in position / and zeroes elsewhere. 
We define q as the /P-d imensional vector whose /th 
component is 

^ = U(y+^)-g(y)]/^ (4) 

for ail /. If we are sure that g is concave, taking dj = 1 
for all / is certainly the most natural choice, but the 
more general form with d } > 1 is sometimes conve- 
nient, e.g., when the sample function is not smooth 
enough and /or not concave (we will return to this). 
If g is nondecreasing (which we assume), then q > 0. 
If g is concave and dj = 1, then cjj is a subgradient for 
g viewed as a function of alone (a one-dimensional 
function with discrete domain) for each /. Unfortu- 
nately, even under these conditions, q is not neces- 
sarily a A-subgradient of g because (1) may fail to 
hold in some diagonal direction (i.e., for some vector 
y — y having several positive components). Atlason 
et al. (2003) pointed out this important issue and 
gave examples; we provide a similar discussion for 
completeness. 

A two-dimensional function g may satisfy g(0, 0) = 
g(l, 0) = #(0, 1) = a and g(l, 1) > a and yet be con- 
cave. For example, g(y\,yi) can be defined by two 
planes that match these values, one plane for y x > y 2 
and another plane for y 1 < y 2 . In this case, we would 
get q = (0, 0) and this vector would not be a sub- 
gradient. The corresponding cut may then remove 
solutions y that do not satisfy (1) and do satisfy the 
service-level constraints of the original problem. So, 
we can only view q as a heuristic guess for a sub- 
gradient. On the other hand, in our experiments, we 
routinely checked (1) in some positive diagonal direc- 
tions, e.g., with each component of y — y equal to one 
or two, and never found any serious problem in the 
cases where g was concave with respect to each of its 
coordinates separately. 

Suppose that g satisfies the submodularity condition 

g(y + d> + d 2 ) + g (y) < g (y + d,) + g(y + d 2 ) (5) 
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for all nonnegative integer vectors d } and d 2 . For the 
functions g 9 in our setting, this condition means that 
adding a vector d 2 of agents to the center brings less 
gain if the current (vector) number is y + d, than if it is 
only y (i.e., the larger the current staffing, the smaller 
the marginal gain of adding more agents). It is quite 
reasonable to expect this to be true, at least when y 
is large enough. Under condition (5), g is necessarily 
concave and the vector q defined by (4) with d } - 1 
for all / always satisfies (1) for y > y. However, it is 
not necessarily a A-subgradient because (1) may fail 
to hold for some vectors y such that y - y has both 
positive and negative components. This was pointed 
out in Atlason et al. (2003). 

In general, if g is not concave or if q is not a subgra- 
dient for some reason, the cut (2) may remove part of 
the set of feasible solutions (in terms of service level) 
of the original problem. If this includes the optimal 
solution(s), we will simply end up with a suboptimal 
solution. For this reason, it is generally a good idea 
to run the algorithm more than once with different 
streams of random numbers and /or slightly different 
parameters, and retain the best solution found. 

3.3.2. Nonconcavity and Related Problems. A 

second (related) issue concerns the concavity of g. 
Typically, g is a convex function of each coordinate 
of y when these coordinates are small, and a concave 
function when the coordinates are large. The ratio- 
nale is that if there are too few agents, almost no 
call is answered within the time limit and adding one 
agent is not likely to change this by much (at least 
for a large center). But if we keep adding agents, 
at some point the fraction g(y) increases at a faster 
rate. Eventually, when there are enough agents so that 
most calls are answered within the time limit, adding 
more agents has little effect on g(y). Figure 2 gives 
a one-dimensional illustration of this. The function 
g(y) has an S-shape. A similar illustration appears in 
Atlason et al. (2003) and this was observed earlier by 
Henderson and Mason (1998) and Ingolfsson et al. 
(2003). For the special case of an M/M/s queue with 
arrival rate A and service rate ji, the concavity of the 
service level as a function of 5 was proved for s > A//i 
by Jagers and van Doom (1991). 

Figure 2 The S-Shaped Service-Level Function 
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In Figure. 2, suppose that / = 0.8 and that we use 
dj = 1 to get our subgradients. The optimal solution in 
this one-dimensional case is the smallest y for which 
#(y) ^ U- 8 ' namely, y =13. It corresponds to point S 
in the figure. If y = 9, where g is concave, the sub- 
gradient q is the slope of the line that passes through 
points C and D in the figure. This line intersects the 
horizontal line g(y) = / = 0.8 at y = 11.95 (point E in 
the figure). The corresponding cut to be added, qy > 
W + ' ~ g(9), can be written as y > 11.95, or y > 12 
because y must be integer. This constraint removes 
only infeasible solutions, i.e., values of y for which 
g{y) < 0.8. On the other hand, if y = 6, we see that 
g is not concave in that area. The subgradient q at 
y = 6 is the slope of the line going through points A 
and B (the lowest dotted line), which intersects the 
line g(y) = 0.8 at y = 15.20. The cut-generation pro- 
cedure would then add the constraint y > 15.20, or 
y > 16, which eliminates a large chunk of the feasible 
set, including the optimal solution. 

Proving the joint concavity around the current solu- 
tion appears much too difficult and impractical, so we 
only check the univariate concavity of g with respect 
to each coordinate y } separately. More precisely, we 
check if q } is a nondecreasing function of d )f by veri- 
fying if 

^(y+(^+i)e;)-sr(y)<^ + i) 

for each ;'. These are only necessary concavity condi- 
tions. Atlason et al. (2004) use a different type of con- 
cavity check; it is multivariate and uses all function 
evaluations made so far. 

These (imperfect) concavity checks are expensive 
because they require several additional simulations 
(at y + (dj + l)e ; ). One possibility is to simply bypass 
them altogether, to save time, at the risk of cutting out 
too much of the feasibility check. To reduce that risk, 
it is a good idea to take larger values of dj when the 
current service level is much smaller than its minimal 
value specified by the constraints. In our computa- 
tional experiments, we found this to be more efficient 
than checking concavity, in the sense that it usually 
yields a very good solution in roughly half the time. 
For the experiments reported in §4, we took d } = 3 
when the service level corresponding to the consid- 
ered cut was less than 0.5, dj — 2 when it was between 
0.5 and 0.65, and dj = 1 when it was greater than 0.65. 

3.3.3. Initial Constraints and What to Do When 
We Get Into Areas of Nonconcavity. At the begin- 
ning of the algorithm, if we relax all the nonlinear 
service-level constraints, the optimal solution of the 
relaxed problem is y = 0. The functions g are obvi- 
ously nonconcave at that point, so we are in trouble 
right from the start! We must find a way of eliminat- 
ing areas of nonconcavity by adding other types of 



Cezik and L'Ecuyen Staffing Multiskill Cull Crntcrs via Unrar /tiynijr/m/rt? ami Simulation 
MrtiW}*.otn»'nt Sciuuv f>4(2). pp. 3KK*2.\ <?>2lXW INFORMS 



317 



Figure 3 Covering Load by Skill Supply 
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constraints to restrict the set of admissible solutions a 
priori, before generating any cut based on a subgra- 
dient. That is, impose y e y for some set °tj in which 
the functions g m are more likely to be concave. 

We obtain such constraints as follows. For any 
given period, we impose that the skill supply of 
agents during that period can cover a fraction a k of 
the load of call type k during that period for all Jfc, for 
some constants a k that are usually close (or equal) to 
one. To explain what this means, we assume for sim- 
plicity that call type k has arrival rate X k and service 
rate fi k (independent of the agent group) in a given 
period, and that the number of agents of each group 
for that period is given by the vector y = (y t , . . . , yrf. 
The load for call type k is p k = X k /fi k . 

Consider the graph of Figure 3, where there is 
an arc of infinite capacity from call type k to agent 
group ; if and only if this agent group can handle 
call type k. Each arc going from an agent group i 
to node B on the right has a capacity equal to the 
number of agents in that group. Each arc going from 
node A to a call type k on the left has capacity equal to 
a k p k , the load that we want to cover for that call type. 
We compute the maximal flow that can go from A to 
B in that graph. If we view the load of each call type 
as fluid, the flow that goes through call type k and 
agent group i corresponds to the load of call type k 
handled by agent group i, i.e., the number of agents 
of group i required to handle that load if these agents 
are busy 100% of their time. This is a real number (not 
necessarily an integer) and we call it the skill supply 
devoted to call type k. If the maximal flow equals 

K 

-del ^ 

this means that there is enough skill supply to handle 
the desired load for each call type. Otherwise, this 
is not possible. This is summarized in the next 
proposition. 

Proposition 4. The maximum flow in the graph of 
Figure 3 is p if and only if there is enough skill supply in 



the agent groups to cover a load of tx k p k for call type k, 
simultaneously for all k ( for a total load of p). 

For any given solution y = y, the maximum flow 
in the graph can be computed easily by a standard 
max-flow algorithm. If the flow is less than p, the 
algorithm finds a minimum cut that provides a valid 
linear inequality on y needed to reach the required 
maximum flow, but violated by the current solution. 
We add this constraint and find a new solution y. This 
is repeated until the maximum flow reaches p. 

This procedure is performed at the beginning of 
the algorithm and is repeated each time we obtain 
a new solution y after having added linear inequali- 
ties based on service-level constraints. The computa- 
tional requirement for solving this max-flow problem 
on a sparse bipartite network such as ours is almost 
insignificant in comparison with the overall comput- 
ing time, regardless of the values of K and / (which 
may reach 100 in large call centers). In our compu- 
tational experiments, we took all a k s equal to one, 
except for a situation that we will describe in a 
moment. 

It is important to underline that this procedure is 
a heuristic, in the sense that being able to handle the 
load (or a given fraction of it, as specified above) 
is neither necessary nor sufficient for the model to 
be stable. It is not necessary because abandonments 
can keep the system stable if the load cannot be cov- 
ered. Even without abandonments, if the load is not 
covered during a given period, a queue will build 
up during that period but these calls could be han- 
dled during the next period, in which there could be 
extra capacity. These are the reasons why we may not 
always take a k = 1 for all k. In the case of steady-state 
models without abandonments, however, we should 
take a k > 1. But even if the maximum flow exceeds p 
with a k > 1 for all k, i.e., if there is enough skill sup- 
ply to simultaneously cover more than the load for 
each call type, this does not necessarily imply that the 
system is stable (see Example 1 in §4, and Garnett and 
Mandelbaum 2000, for counterexamples). Stability in 
this case generally depends on the priority rules in 
the routing strategies. The procedure is nevertheless 
very useful to cut out areas of nonconcavity of the 
functions g 9 . 

Even when the skill supply can cover the load of all 
call types, it may happen that certain call types have 
extremely low service level if the routing rules give 
them low priority. When we want to generate a sub- 
gradient cut based on the service-level threshold for 
such a call type, we run into a nonconcavity problem. 
The "subgradient" is often nearly flat (zero or almost 
zero) and the corresponding cut would remove all 
the good solutions. In this case (which typically hap- 
pens for highly unbalanced systems, where certain 
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typos of calls have very low priority but yet require 
a minimal service level), we need a different type of 
heuristic. What we do is simply increase the vaJue of 
a k used in the max-flow problem for the call type k 
having the highest service-level gap, i.e., the small- 
est value of g kf ,(y) - i kfP or g k (y) - If.. The trick is 
to increase a k just enough to get out of the noncon- 
cavity area. An appropriate value could be found by 
trial and error, checking the concavity and the service 
levels obtained for each trial In our computational 
experiments, we adopted a heuristic that increases a k 
so that the resultant service level for call type k is 
between 0.01 and 0.1. For the examples that we tried, 
this always appeared sufficient to get rid of these flat 
subgradients, and taking a larger a k was counterpro- 
ductive (empirically). 

When very low service levels are obtained for cer- 
tain types of calls even if the global service level is 
high, this may indicate poor routing strategies. In that 
case, significant improvements on the service level 
per call type can often be obtained by changing the 
routing rules, which is likely to be less expensive than 
adding more agents. 

3.3.4. Cut-Generation Priorities. The problem of 
certain call types having zero (or near zero) service 
level happens more frequently when the global ser- 
vice level is still low as well. Moreover, when the 
global service level is higher, the service level for a 
given type of call tends to be concave over a wider 
range of values than when the global service level 
is low. We observed empirically that in many cases, 
when the global service level is high, the service level 
of a given call type is concave as a function of a given 
y ; practically as soon as it becomes positive. 

For this reason, as long as the global service level 
is below a given threshold we only generate cuts 
based on the global service-level constraint. After that 
threshold has been reached, at each step of the algo- 
rithm we generate and add a cut for each service-level 
constraint that is not satisfied. All these constraints 
are added simultaneously. Adding several constraints 
at the same time generally makes the algorithm more 
efficient because it reduces the total number of steps 
and thus the number of simulations runs. On the 
other hand, adding bad cuts (obtained in nonconcave 
areas) can be damaging. So, there is a compromise to 
be made and the choice of i m decides on that compro- 
mise. In our experiments, that threshold was 0.65. 

3.3.5. Problems with the Smoothness of Sam- 
ple Functions. Yet another difficulty comes from the 
fact that the sample functions G n % are typically less 
smooth than their "expected value" counterparts g m . 
This was also noticed by Atlason et al. (2004). Because 
of that, G n t% is often locally nonconcave in the area 
where g, is concave. The explanation is that for a 



Figure 4 Sample Service-level Function of a Call Type (From the 
Large Example) When All Except One Agent Groups are Fixed 
and the Global Service Level Is Greater than 0.65 
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given sample path, if we increase the number of 
agents one at a time, the actual number of calls 
that get served within the time limit may increase 
sometimes by a large amount, sometimes by a small 
amount, sometimes not at all, and the amount of 
increase is not necessarily monotone, for a small n, 
even if it becomes monotone when n -* oo. When we 
increase the sample size n they become smoother, so 
the problem diminishes, but it may take a very large 
value of n before it disappears completely, and that 
value is random. Increasing n also makes the simula- 
tion much more expensive. 

This is illustrated in Figure 4, which shows the 
actual sample function G n k for a given call type k as 
a function of the number of agents of a given group 
that can handle that call type, with the number of 
agents in all other groups held fixed, in a simulation 
of 50 hours of operation of the large example of §4. 
The sample function is flat between 6 and 7, between 
22 and 23, and between 24 and 25. If we use d ] = 1 
in this case, at y = 6 and 22 (points A and B in the 
figure), the subgradient is zero and the corresponding 
cut removes everything! The subgradient is also zero 
at y = 24 (C in the figure), but the constraint is already 
satisfied at that point. At D and E, the subgradient is 
not zero, but it is small enough so the corresponding 
cut would impose an excessive lower bound on i/. 

This type of problem can be detected by univariate 
concavity checks. When it occurs, we could simply 
use larger values of d } in the subgradient estimators. 
In the example in the figure, for example, taking dj = 2 
or 3 would do fine. A sensible heuristic could be to 
try dj = 1, 2, and 3, and take the one that gives the 
largest slope. When the service level G n . at the cur- 
rent solution is still much smaller than the threshold, 
it is usually a good idea to start directly with some 
dj > 1. In our computational experiments of §4, we 
simply bypassed all concavity checks (to save time) 
and directly used larger values of dj as explained 
earlier. 

With all these heuristics, subgradients close to zero 
are still encountered once in a while and we must 
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avoid adding the corresponding cuts. For that, we 
use another threshold: when all coordinates of q are 
smaller in absolute value than this threshold, we do 
not add this cut. The other cuts added at the same 
step will usually take care of improving the situa- 
tion enough to circumvent the problem. This thresh- 
old was set at 0.01 in our experiments. 

3.3.6. Relaxing Integrality and Rounding Up. 

For problem instances where the vector y has a small 
dimension, we can solve the sample problem at each 
step (i.e., after adding new constraints) by integer pro- 
gramming (IP). The simulation time (to evaluate the 
service levels required to compute q) typically dom- 
inates the total CPU time in that case. But when the 
problem becomes too large, the time required for solv- 
ing the integer program becomes excessive (this time 
increases exponentially with the size of the problem) 
and we must find an alternative. 

What we do in that case is simply relax the integral- 
ity constraints: at each iteration, we solve the stan- 
dard linear programming (LP) problem instead of the 
IP problem. Then, we round up each component of 
the solution to an integer. Obviously, all constraints 
satisfied by the optimal LP solution are also satis- 
fied by the rounded-up vector. We then run simu- 
lations with this rounded-up solution to check the 
service-level constraints, check concavity, and esti- 
mate a subgradient. 

At the last iteration, when we find a rounded- 
up solution that satisfies all service-level constraints, 
we perform a local search around that solution to 
see if it can be easily improved. For the numerical 
experiments reported here, the local search was rather 
simplistic: we just decreased each component of the 
vector y by one, by decreasing order of agent cost, 
until we reached infeasibility. This local search proce- 
dure could certainly be improved. 

We also experimented this approach on medium- 
sized problems, where IP was practical but yet took 
a significant fraction of the total CPU time. With the 
LP+rounding approach, the time for solving the LP 
was almost negligible compared with the simulation 
time (so the overall algorithm was faster) and the 
results were generally within 1% of the IP results. 

4. Computational Experiments 

The numerical examples considered here are for a sin- 
gle period only and we solve the staffing problem 
under the assumption that the system is in steady 
state. For each k, the arrivals are assumed to be from 
a stationary Poisson process with rate A*, while the 
service times and patience times are exponential with 
respective rates fi k and i> k . All these rates are in calls 
per hour. 



For all experimental results reported here, the 
thresholds and parameters of the heuristics were 
selected as mentioned in the previous section. Thus, 
all examples were solved with the same algorithm. 
Our computer had a 2.0 GHz AMD Athlon 64 Pro- 
cessor 3, 200+ running Linux, SUN Java Develop- 
ment Toolkit 1.4.2, and CPLEX 8.1 was our LP/IP 
solver. The simulations were performed using the 
contact center simulation package described in Buist 
and L'Ecuyer (2005), developed in Java in the frame- 
work of the SSJ simulation package (L'Ecuyer 2004). 
In our simulations, we always start the system full (all 
agents busy but no call in the queues). At each value 
of y, we perform a single simulation run of length 
r+7720 (in the simulation time frame) for some con- 
stant 7, divide it into 21 batches of length T/20, 
and discard the first batch. The 20 other batches are 
used to compute estimates and confidence intervals 
for the service levels, globally and for each call type. 
Unless stated otherwise, T = 50 hours, so we have 
2.5 hours of warm-up and 50 hours of simulation. For 
the smaller examples, we tried 7-500 hours for com- 
parison. The results differed by less than 1% except 
for a single case where the longer simulation found a 
much better solution (see Example 1). The total com- 
puting times are approximately proportional to T. 

4.1. Staffing a Center of Moderate Size 

We first consider two examples of moderate size, one 
with five call types and 12 agent groups, the other 
with 20 call types and 15 agent groups. Example 1 is 
adapted from Koole and Talim (2000). 

For the routing, we assume that each call type has 
an ordered list of agent groups, used to select an avail- 
able agent upon arrival. If all agent groups in that 
list are busy, the call joins a queue (one FIFO queue 
per call type). Each agent group has an ordered list 
of call-type queues. When an agent from this group 
becomes available, it selects the first nonempty queue 
from its list and picks the first call in the queue. 

Example 1. In our first example, there are five call 
types and 12 skill groups, defined by the rows and 
columns of Table 1, respectively. The priorities are 
very simple: an arriving call checks for an available 
agent by numeric order of agent groups and an agent 
that becomes available also looks at call queues in 
numeric order. Such simplistic priorities make the 

Table 1 Skill Groups for Eiample 1 

k Agent group / 

11 3 4 5 7 8 9 11 12 

2 3 6 7 8 11 12 

3 2 4 6 7 9 10 11 12 

4 5 10 12 

5 8 9 10 11 12 
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Table 2 Results for Example 1 



Aban. I Alga Cuts Subgrad. Sim. CPU (sec.) Obj. OoS OoS-KT QoS per call type 



Staffing vector 



No 


0.0 


IP 


9 


9 


120 


116 


220.9 


0.804 


0.115 


No 


0.0 


LP 


6 


6 


70 


66 


222.1 


0.803 


0.454 


Yes 


0.0 


IP 


10 


10 


129 


114 


219.5 


0.801 




Yes 


0.0 


LP 


4 


4 


57 


45 


220.5 


0.804 




No 


0.5 


IP 


9 


5 


73 


67 


263.2 


0.863 


0.860 


No 


0.5 


LP 


7 


3 


60 


50 


251.9 


0.803 


0.866 


Yes 


0.5 


IP 


35 


29 


365 


280 


2247 


0.801 




Yes 


0.5 


LP 


14 


13 


167 


131 


225.4 


0.809 





(0.99. 0.90, 0.99, 0.80. 0.00) (4. 36. 5. 28. 45. 45. 1. 13. 0. 24. 0. 0) 

(0.99, 0.88. 0.99. 0.84, 0.00) (36. 38. 0. 0, 45. 46. 2. 15. 0. 23. 0. 0) 

(0.99, 0.93, 0.95, 0.84. 0.21) (32, 30, 0. 1. 46, 44, 0, 30. 0, 16. 0. 0) 

(0.99, 0.94, 0.97, 0.88, 0.12) (36, 30. 15, 0, 45, 39, 0. 17. 0, 21 , 0. 0) 

(0.99. 0.88. 0.99. 0.92. 0.59) (0. 30, 0. 24. 0. 0. 0. 0. 0, 65. 0. 92) 

(0.96. 0.51 . 1 .00. 0.94. 0.62) (0. 47. 0. 1 3, 0. 14. 0. 66. 0. 80, 0. 0) 

(0.99, 0.61, 0.99, 0.85, 0.57) (26, 25. 11, 1, 36, 39, 0. 0. 30, 35. 0, 0) 

(0.99. 0.76, 0.98, 0.76. 0.54) (25. 24, 0. 7, 45, 44, 0, 12, 33, 14, 0. 0) 



system highly unbalanced and the problem is then 
more difficult to solve because the service level of the 
low-priority call types tends to be very low (in the 
nonconcavity zone). 

The cost of an agent with s+ 1 skills is assumed to 
be 1 + ks, where k = 0.10. The arrival rates and service 
rates per hour are A 1 = A 3 = A 5 = 440, A 2 = A 4 = 540, 
and fi k = 12 for all k. We require the overall service 
level to be at least / = 0.8. For the values of l k , we 
consider two cases: l k = 0 (no constraint on the ser- 
vice level per call) and l k = 0.5. Regarding the aban- 
donments, we also consider two cases: (1) p k = 0 (no 
abandonments), and (2) v k = 10 for each k. This gives 
a total of four combinations. For each of them, we use 
the original method that solves an IP at each step and 
the method that replaces this IP by an LP and round 
up every solution. The results for the eight cases are 
in Table 2. For each case, the table indicates if there 
were abandonments (aban.), gives the minimal ser- 
vice level per call type (l k = 0.0 or 0.5), says if the IP 
or the LP was solved at each step (algo.), then gives 
the total number of cuts generated by the algorithm 
excluding the initial load-covering constraints (cuts), 
the number of points y where subgradients (global 
or per call type) were estimated (subgrad.), the total 
number of simulations (sim.), the total CPU time in 
seconds (CPU), the global service level for the final 
solution (QoS), the global service level estimated by 
the Koole-Talim approximation (Koole et al. 2003) for 
that same solution (QoS-KT), the service level per call 
type (QoS per call type) for the final solution, and 
the staffing vector y in the final solution. The service- 
level values reported here are accurate to approxi- 
mately ±0.02% with 95% confidence. The Koole-Talim 
approximation can be applied only when there are no 
abandonments. Roughly, we need / + 1 = 13 50-hour 
simulations each time we compute a subgradient. The 
number of times we compute a subgradient is typ- 
ically equal to the number of generated cuts when 
l k = 0 and smaller when l k > 0 because in that case 
several cuts can be added at the same time. 

Except for one case (see below), solving the IP as 
an LP at each step (i.e., using the rounding method) 



increases the value of the objective function for the 
final solution, by about 1% on average, while reduc- 
ing the CPU time roughly by a factor of two or three. 
For a small example such as this one, it may be worth 
spending this additional CPU time to get a better 
solution, i.e., solve the IP at each iteration. For larger 
examples, however, the factor of CPU times between 
the IP and LP cases can be huge, and solving the LP 
eventually becomes the only practical solution. The 
exception is the case where l k = 0.5 and there are no 
abandonments. Then, the IP and LP methods give 
totally different staffing vectors, with a large differ- 
ence in the objective function value, and the LP hap- 
pens to find a better solution than the IP. We reran this 
particular (difficult) case with 500 hours of simulation 
and the IP method found a much better solution, 
with value of 244.3, whereas LP found one with value 
248.2. 

Another interesting observation is that when l k = 0, 
call type 5 has a very poor service level in the 
optimal solution: only a small fraction are answered 
within 20 seconds when there are abandonments 
and none when there are no abandonments. In the 
no-abandonments case, the system is actually unsta- 
ble for call type 5. An infinite queue does build up 
with the retained solution, even if there is enough 
skill supply to cover more than the load of each call 
type simultaneously. (This explains the fact that the 
average of service levels per call type weighted by 
their arrival rates does not match the global service 
level; several calls of type 5 are still in the queue at 
the end of the simulation and are not accounted for.) 
The instability is due to the fact that each agent group 
that can handle calls of type 5 considers them with 
the lowest possible priority, and can handle at least 
two other types of calls with higher priorities. With 
these fixed (and unbalanced) priority rules, when we 
add agents to these groups, these agents first take 
care of other call types before handling calls of type 5 
and there is not enough capacity left to handle their 
load. So, we must add a large number of these (rather 
generalist) agents to really increase the service level 
of this call type. This is why the objective function 
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increases significantly when we impose a minimal ser- 
vice level per call type with l k = 0.5. This increase is 
much more pronounced when there are no abandon- 
ments because the abandonments (of all call types) 
during the periods of high congestion have a large 
impact in improving the service level of call type 5. 
For the case where l k = 0.5 with abandonments, with 
the retained solutions with both IP and LP, the aban- 
donment ratio is approximately 3.5% globally and 
12% for call type 5. Of course, less expensive solu- 
tions could be found if we were allowed to change 
the priority rules of the agents. 

When solving the case l k = 0.5 without abandon- 
ments, we first add cuts only for the global constraint 
(as described in §3.3) and come up with a solution 
with a service level of zero for call type 5. At that 
point, we cannot add a subgradient cut based on the 
service-level constraint of this call type because the 
subgradient is zero. This is a case where we must 
inflate a 5 in the max-flow problem. 

The Koole-Talim approximation for the global ser- 
vice level gives totally unreliable values when the sys- 
tem is highly unbalanced and some call types have 
very low service levels. It behaves much better for bal- 
anced systems. We also observed this in other exam- 
ples and this had been pointed out already by Koole 
et al. (2003). 

Example 2. This is a slightly larger example, with 
20 call types and 15 agent groups, defined by the rows 
and columns of Table 3. The priority rules, cost func- 
tion, abandonment rates, and service-level require- 
ments are as in the first example. The arrival rates and 
service rates per hour are A a = A 2 = 240, A 3 = A 17 = 160, 
A 4 = A 7 = A 9 = A M = A 19 = A 20 = 260, A 5 = A 8 = A 16 = 

Table 3 Skill Groups for Example 2 

Agent group / 



1 


1 




3 




5 




7 




9 














2 


1 




















11 




13 




15 


3 




2 




4 




6 




8 




10 












4 








4 
















12 




14 




5 


1 


2 






5 












11 










6 






3 








7 


8 




10 












7 










5 








9 






12 


13 






8 










5 


6 








10 




12 




14 


15 


9 




2 




4 


5 


6 








10 












10 










5 


6 






9 








13 


14 




11 


1 








5 






8 




10 




12 








12 








4 










9 




11 






14 


15 


13 




2 






5 




7 






10 










15 


14 






3 










8 


9 








13 




15 


15 




2 








6 


7 














14 


16 


1 








5 










10 




12 








17 




2 








6 










11 










18 






3 


4 


















13 


14 




19 




2 












8 








12 






15 


20 






3 






6 




6 








12 


13 


14 





A lg = 130, A 6 = A ir , = 230, A 1P = 125, A„ = 235, A 12 = 
155, A , 3 = 225, and p } = 12 for all k. The results are in 
Table 4. 

Again, the rounding method increases the objective 
function value by about 1% on average, and reduces 
the CPU time by a factor of one or four. The unbal- 
anced priority rules bring the same types of problems 
as in Example 1. The low-priority calls get extremely 
poor service level with l k = 0 and the objective func- 
tion value increases significantly when we impose 
l k = 0.5 and there are no abandonments. For l k = 0.5 
with abandonments, for the retained solutions with IP 
and LP, the abandonment ratio is approximately 3.5% 
globally and higher for the low-priority call types. For 
example, for call types 16, 18, and 20, it is 9%, 10%, 
and 14%, respectively 

4.2. Staffing a Large Center 

We now consider a model inspired by a large real- 
life call center operated by Bell Canada. There are 
89 call types and 65 skill groups. We do not pro- 
vide the details of the arrival, service, and aban- 
don rates, and the call push/pull matrices because of 
space limitation and the proprietary nature of some of 
these details. In summary, the model assumes that the 
arrivals are Poisson with rates A* varying from 1.046 
to 416.6, and the global rate is 3,581.7. The service 
times are assumed exponential with rates fi k varying 
from 0.6777 to 600. The aggregate load (£ k \ k /fi k ) is 
500. The patience times are exponential, with v k = 2.0 
for all k. For the service-level constraints, we take 
/ = 0.8 globally and l k = 0.5 for each call type. 

Both the call types and the agent groups are split 
between two locations. One location has 22 call types 
and 15 agent groups; the second location has 43 call 
types and 74 agent groups. The number of skills per 
agent group goes from 1 to 24. We model the cost of 
an agent with 5 + 1 skills as 1 + ks, where k = 0.05. 

The router uses a set of priority rules named "local 
specialist routing policy," which tries to assign any 
incoming call primarily to an agent based in the loca- 
tion from where the call originates. If more than 
one such local agent can handle this call, the router 
chooses the agent with the smallest number of skills. 
In case of equalities, the agent with the longest idle 
time is chosen. If no local agent is available to serve 
the call, it is put in a queue corresponding to its 
type. When a call has spent more than six seconds in 
the queue, the router tries to assign it to any agent 
from the other location. If the router succeeds, the 
call leaves the queue and is served remotely. On the 
other hand, when an agent becomes free, all the local 
queues are queried for a call and the call with the 
longest waiting time is taken. By using this policy, the 
router behaves as if there was only one queue for all 
call types. If there are still free agents after the queues 
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Table 4 Results for Example 2 



Aban. I> Algo. Cuts Subgrad. Sim. CPU (sec.) Obj. QoS QoS-KT QoS per call type Staffing vector 

No 00 IP 10 10 159 232 4677 0.802 0.079 (1.00.0.99.0.99.0.93,0.98.0.99. (31.0.55.61.0 0,74.21 33 0 54 

0.76. 0.00. 0.61, 0.30. 0.82. 0.97. 12. 0. 0. 0. ) 
0.98. 0.94, 0.94, 0.27.0.88. 0.89 
0.00. 0.46) 

No 00 LP 2 2 40 62 472.6 0.818 0.280 (1.00.0.99,0.99,0.96.0.99.0.99. (30,0.27,68,0.0.78.22.50.0.57 

0.94. 0.00, 0.76. 0.77, 0.86, 0.99, 11, 0. 0, 1)' 
0.98,0.90,0.93.0.37,0.97.0.81. 
0.00. 0.00) 

Yes 00 IP 7 7 125 184 466.6 0.800 (1.00.1.00.0.99,0.95.0.99.0.99. (32.0.23.72.0.0,77,17.52.0 51 

0.93.0.34,0.82,0.80,0.85,0.99, 0,0,1,15) 
0.98.0.91,0.94.0.48,0.96,0.85, 
0.03, 0.07) 

Yes 0.0 LP 7 7 114 164 470.8 0.807 (1.00,0.99.0.99,0.95,0.99,0.99, (30.0,38,72,0,0.77,22,43,0,50, 

0.84,0.15,0.83.0.63,0.87,0.99. 0,0,0,11)' 
0.99, 0.94, 0.96, 0.45, 0.95, 0.92. 
0.03. 0.39) 

No 0 5 ,P 40 23 3 6 2 486 501.6 0.872 0.799 (1.00,0.98,0.99,0.99,0.96,0.99, (46,9,52,63,0,0,42,51,42,0.27, 

0.94, 0.50, 0.91, 0.78, 0.98, 0.96, 13, 0. 10, 0) 
0.86, 0.98. 0.57. 0.75, 0.56, 0.97, 
0.69, 0.78) 

No 0.5 LP 26 9 158 226 500.4 0.868 0.805 (1.00.0.99,0.99.0.96,0.97,0.99. (41,0,41,55.0,0.51.53,56,0,36, 

0.98, 0.50, 0.64, 0.90, 0.98, 0.98, 1 6, 0, 0, 8) 
0.90,0.98,0.61,0.76,0.79,0.90, 
0.71,0.70) 

Yes 05 l p 9 3 57 105 474.0 0.831 (1.00,0.99,0.99,0.90.0.97,0.99, (29,0.9,59,0,0,69,72,40,0,41. 

0.76,0.57,0.62,0.50.0.98,0.95, 0.0,0,20) 
0.98, 0.95, 0.89, 0.55, 0.78, 0.52, 
0.76,0.51) 

Yes 0.5 LP 12 3 54 87 473.4 0.825 (1.00.1.00,0.97,0.83,0.99,0.99, (43,0.45,61.0.0,69,0,40,0,42. 

0.79, 0.91 , 0.50, 0.53, 0.77, 0.99. 0. 0. 0, 43) 

0.99, 0.97, 0.94, 0.53, 0.88, 0.88, 
0.50,0.51) 



are queried, the query is made a second time, allow- 
ing agents to serve calls from the other location. 

We first tried to solve this problem as an IP at 
each step, but the CPU time requirement was unrea- 
sonably high. So, we solved the LP (with rounding 
up) instead. The resolution took 37 minutes of CPU 
time. Less than 1% of this time was used for solv- 
ing the LPs; most of it was used for the simulations. 
Finding the initial solution that satisfies the load- 
coverage constraints took six seconds and generated 
82 constraints. 

We obtained a solution with 526 agents (for a load 
of 500), for which the objective function value was 
669.35. The average number of skills per agent is 
6.45, and the retained staffing vector has 37 nonzero 
entries ranging from 1 to 68. For that solution, the 
50-hour simulation (used by our algorithm) estimates 
a global service level at 0.83 ± 0.02 (with 95% confi- 
dence), the average agent occupancy at 94.5%, and the 
abandonment ratio at 1.1%. We also ran a 5,000-hour 
simulation with this retained solution to check the ser- 
vice levels with better precision. This time, the global 
service level was estimated as 0.845 ±0.002, and three 



call types had estimated service levels slightly below 
0.5, namely, 0.49 ± 0.01, 0.47 ± 0.01, and 0.47 ± 0.01 
(with 95% confidence). After observing this, it would 
be easy to add a few more cuts to increase these three 
service levels and do one or two additional iterations, 
but we did not do that. 

Interestingly, the five call types having the largest 
arrival rates, which account for 42.3% of the total 
call volume, are also those that have the largest ser- 
vice level, all around 0.99, in the retained solution. 
The explanation is that these call types are favored 
by the current (fixed) priority rules. In contrast, the 
sixth most-frequent call type has its service level at 
the lower bound of 0.5, with an abandonment ratio 
of 5.2%. This call type can be handled by a single- 
agent group. This group has 53 agents with 11 differ- 
ent skills and is at a different location than the call 
type, which explains the poor service level. Its occu- 
pation rate is 93.7%, a little below the average. 

It must be emphasized that the retained solution 
is random and may change significantly when we 
vary the sample size and/or the random number 
streams. To illustrate this, we ran the same algorithm 
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with SOM-hour instead of 50-hour simulations, and 
found a solution with objective function value of 664.3 
after about six hours of CPU time. We also replicated 
the algorithm five more times independently with 
50-hour simulations and obtained solutions whose 
values ranged from 663.1 to 693.3, with global QoS 
ranging from 0.84 to 0.89, and CPU times ranging 
from 31 to 59 minutes. 

In this model, the objective function could be re- 
duced significantly by reducing the average number 
of skills per agent and optimizing the skill mixes, 
agent group locations, and routing rules and priori- 
ties. This is beyond the scope of this paper and will 
be addressed in future work. 

5. Conclusion 

We have adapted the cutting-plane methodology of 
Atlason et al. (2004) to the problem of optimal staffing 
of a multiskill call center. We introduced several 
heuristics to make the approach practical for large 
problem instances. These heuristics include, for exam- 
ple, getting an initial set of constraints by solv- 
ing a max-flow problem, computing finite differences 
with steps larger than one (selected adaptively), and 
rounding up the solution of an LP instead of solving 
the exact IP. The methodology replaces the unknown 
expectations in the constraints of the original problem 
by sample averages. It is a heuristic: there is no guar- 
antee that the optimal solution to the sample problem 
is optimal or even feasible for the original problem, 
and no guarantee that the returned solution is optimal 
or feasible for the sample problem. But ''good" (not 
too far from optimal) solutions are returned most of 
the time. These solutions can (and should) be refined 
at the end of the algorithm via local search heuristics. 
Further work is needed in that direction. It is also a 
good idea to run the algorithm more than once with 
different streams of random numbers and perhaps 
slightly different parameters, and pick up the best of 
the solutions thus obtained. This provides a viable 
approach to obtain reasonable solutions for important 
practical problems for which no methodology that we 
know can return guaranteed optimal solutions. 

6. Electronic Companion 

An electronic companion to this paper is available as 
part of the online version that can be found at http:// 
mansci.journal.informs.org/. 
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Abstract 

We examine and compare simulation-based algorithms for solving the agent scheduling problem in a 
multiskill call center. This problem consists in minimizing the total costs of agents under constraints 
on the expected service level per call type, per period, and aggregated. We propose a solution 
approach that combines simulation with integer or linear programming, with cut generation. In our 
numerical experiments with realistic problem instances, this approach performs better than all other 
methods proposed previously for this problem. We also show that the two-step approach, which is 
the standard method for solving this problem, sometimes yield solutions that are highly suboptimal 
and inferior to those obtained by our proposed method. 
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1 Introduction 



The telephone call centeT industry employs millions of people around the world and is fast growing. 
In the United States, for example, customer service representatives held 2.1 million jobs in 2004, 
and employment in this job category is expected to increase faster than average at least through 20 14 
(Bureau of Labor Statistics 2007). A few percent saving in workforce salaries easily means several 
million dollars. 

Call centers often handle several types of calls distinguished by the required skills for delivering 
service. Training all agents to handle all call types is not cost-effective. Each agent has a selected 
number of skills and the agents are distinguished by the set of call types they can handle (also called 
their skill set). When such skill constraints exist, we speak of a multiskill call center. Skill-based 
routing (SBR), or simply routing, refers to the rules that control the call-to-agent and agent-to-call 
assignments. Most modern call centers perform skill-based routing (Koole and Mandelbaum 2002, 
Gans et al. 2003). 

In a typical call center, inbound calls arrive at random according to some complicated stochastic 
processes, call durations are also random, waiting calls may abandon after a random patience time, 
some agents may fail to show up to work for any reason, and so on. Based on forecasts of call 
volumes, call center managers must decide (among other things) how many agents of each type (i.e., 
skill set) to have in the center at each time of the day, must construct working schedules for the 
available agents, and must decide on the call routing rules. These decisions are made under a high 
level of uncertainty. The goal is typically to provide the required quality of service at minimal cost. 

The most common measure of quality of service is the service level (SL) ? defined as the long- 
term fraction of calls whose time in queue is no larger than a given threshold. Frequently, multiple 
measures of SL are of interest: for a given time period of the day, for a given call type, for a given 
combination of call type and period, aggregated over the whole day and all call types, and so on. For 
certain call centers that provide public services, SL constraints are imposed by external authorities, 
and violations may result in stiff penalties (CRTC 2000). 

In this paper, we assume that we have a detailed stochastic model of the dynamics of the call 
center for one day of operation. This model specifies the stochastic processes for the call arrivals 
(these processes are usually non-stationary and doubly stochastic), the distributions of service times 
and patience times for calls, the call routing rules, the periods of unavailability of agents between 
calls (e.g., to fill out forms, or to go to the restroom, etc.), and so forth. We formulate a stochastic 
optimization problem where the objective is to minimize the total cost of agents, under various SL 
constraints. This could be used in long-term planning, to decide how many agents to hire and for 
what skills to train them, or for short-term planning, to decide which agents to call for work on a 
given day and what would be their work schedule. The problem is difficult because for any given 
fixed staffing of agents (the staffing determines how many agents of each type are available in each 
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time period), no reliable formulas or quick numerical algorithms are available to estimate the SL; 
it can be estimated accurately only by long (stochastic) simulations. Scheduling problems are in 
general NP-hard, even in deterministic settings where each solution can be evaluated quickly and 
exactly. When this evaluation requires costly and noisy simulations, as is the case here, solving the 
problem exactly is even more difficult and we must settle with methods that are partly heuristic. 

Staffing in the single-skill case (i.e., single call type and single agent type) has received much 
attention in the call center literature. Typically, the workload varies considerably during the day 
(Gans et al. 2003 ? Avramidis et al. 2004, Broun et al. 2005), and the planned staffing can change 
only at a few discrete points in time (e.g., at the half hours). It is common to divide the day into 
several periods during which the staffing is held constant and the arrival rate does not vary much. 
If the system can be assumed to reach steady-state quickly (relative to the length of the periods), 
then steady-state queueing models are likely to provide a reasonably good staffing recommendation 
for each period. For instance, in the presence of abandonments, one can use an Erlang-A formula 
to determine the minimal number of agents for the required SL in each period (Gans et al. 2003). 
When that number is large, it is often approximated by the square root safety staffing formula, based 
on the Halfin-Whitt heavy-traffic regime, and which says roughly that the capacity of the system 
should be equal to the workload plus some safety staffing which is proportional to the square root of 
the workload (Halfin and Whitt 1981, Gans et al. 2003). This commonly used heuristic, known as 
the stationary independent period by period (SIPP) approach, often fails to meet target SL because 
it neglects the non-stationarity (Green et al. 2003). Non-stationary versions of these approximations 
have also been developed, still for the single-skill case (Jennings et al. 1996, Green et al. 2003). 

Scheduling problems are often solved in two separate steps (Mehrotra 1 997): After an appropriate 
staffing has been determined for each period in the first step, a minimum-cost set of shifts that 
covers this staffing requirement can be computed in the second step by solving a linear integer 
program. However, the constraints on admissible working shifts often force the second step solution 
to overstaff in some of the periods. This drawback of the two-step approach has been pointed out 
by several authors, who also proposed alternatives (Keith 1979, Thompson 1997, Henderson and 
Mason 1998, Ingolfsson et al. 2003, Atlason et al. 2004). For example, the SL constraint is often 
only for the time-aggregated (average) SL over the entire day; in that case, one may often obtain 
a lower-cost scheduling solution by reducing the minimal staffing in one period and increasing it 
in another period Atlason et al. (2004) developed a simulation-based methodology to optimize 
agents' scheduling in the presence of uncertainty and general SL constraints, based on simulation 
and cutting-plane ideas. Linear inequalities (cuts) are added to an integer program until its optimal 
solution satisfies the required SL constraints. The SL and the cuts are estimated by simulation. 

In the multiskill case, the staffing and scheduling problems are more challenging, because the 
workload can be covered by several possible combinations of skill sets, and the routing rules also 
have a strong impact on the performance. Staffing a single period in steady -state is already difficult; 



3 



the Erlang formulas and their approximations (for the SL) no longer apply.. Simulation seems 
to be the only reliable tool to estimate the SL. Ce?ik and L/Ecuyer (2007) adapt the simulation- 
based methodology of Atlason et al. (2004) to the optimal staffing of a multiskill call center for a 
single period. They point out difficulties that arise with this methodology and develop heuristics 
to handle them. Avramidis et al. (2006) solve the same problem by using neighborhood search 
methods combined with an analytical approximation of SLs, with local improvement via simulation 
at the end. Pot et al. (2007) impose a constraint only on the aggregate SL (across all call types); they 
solve Lagrangean relaxations using search methods and analytical approximations. 

Some authors have studied the special case where there are only two call types, and some have 
developed queueing approximations for the case of two call types, via Markov chains and under 
simplifying assumptions; see Stolletz and Helber (2004) for example. But here we are thinking of 
20 to 50 call types or more, which is common in modern call centers, and for which computation 
via these types of Markov chain models is clearly impractical. 

For the multiskill scheduling problem, Bhulai et al. (2007) propose a two-step approach in which 
the first step determines a staffing of each agent type for each period, and the second step computes 
a schedule by solving an IP in which this staffing is the right-hand side in key constraints. A key 
feature of the IP model is that the staff-coverage constraints allow downgrading an agent into any 
alternative agent type with smaller skill set, separately for each period. Bhulai et al. (2007) recognize 
that their two-step approach is generally suboptimal and they illustrate this by examples. 

In this paper, we propose a simulation-based algorithm for solving the multiskill scheduling 
problem, and compare it to the approach of Bhulai et al. (2007). This algorithm extends the method 
of Ce?ik and L'Ecuyer (2007), which solves a single-period staffing problem. In contrast to the two- 
step approach, our method optimizes the staffing and the scheduling simultaneously. Our numerical 
experiments show that our algorithm provides approximate solutions to large-scale realistic problem 
instances in reasonable time (a few hours). These solutions are typically better, sometimes by a 
large margin (depending on the problem), than the best solutions from the two-step approach. We 
are aware of no competitive faster method 

The remainder of this paper is organized as follows. In section 2, we formally define the problem 
at hand and provide a mathematical programming formulation. The new algorithm is described in 
3. We report computational results on several test instances in section 4. The conclusion follows. 
A preliminary version of this paper was presented at the 2007 Industrial Simulation Conference 
(Avramidis et al. 2007a). 

2 Model Formulation 

We now provide definitions of the multiskill staffing and scheduling problems. We assume that we 
have a stochastic model of the call center, under which the mathematical expectations used below 
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are well defined, and that we can simulate the dynamics of the center under this model. Our problem 
formulations here do not depend on the details of this model. 

There are K call types, labeled from 1 to /C, and / agent types, labeled from 1 to /. Agent type / 
has the skill set S, -C { 1, . . . , K}. The day is divided into P periods of given length, labeled from 1 to 
P. The staffing vector is y = (} ? J . I > • • ■ O'l .y, . . - ,37, 1 , • . • ,)'j.p) 1 where y^ p is the number of agents of 
type / available in period p. Given y, the service level (SL) in period p for type-* calls is defined as 

g kmP (y) = E{S g ^ p ]/E\S k , p +A kmP ], 

where S k , p is the number of type-/: calls that arrive in period p, S^ p is the number of those calls 
that get served after waiting at most x k%p (a constant called the acceptable waiting time), and A k%p is 
the number of type-* calls that abandon in period p after waiting at least % k p . Aggregate SLs, per 
call type, per period, and globally, are defined analogously. Given acceptable waiting times x p < x ki 
and T, the aggregate SLs are denoted by g p (y), g k {y) and g(y) for period /?, call type k, and overall, 
respectively. 

A shift is a time pattern that specifies the periods in which an agent is available to handle calls. In 
practice, it is characterized by its start period (the period in which the agent starts working), break 
periods (the periods when the agent stops working), and end period (the period when the agent 
finishes his/her workday). In general, agents have several breaks of different duration; for instance, 
morning and afternoon coffee breaks, as well as a longer lunch break. 

Let { 1 , . . . , Q} be the set of all admissible shifts. To simplify the exposition, we assume that this 
set is the same for all agent types; this assumption could easily be relaxed if needed, by introducing 
specific shift sets for each agent type. The admissible shifts are specified via a P x Q matrix Ao 
whose element (p,q) is a p # = 1 if an agent with shift q works in period p, and 0 otherwise. A 
vector x = (*!,!, .. . ,*i ? g, . . . , • • - ,*7,g)\ where x^ q is the number of agents of type i working 
shift q, is a schedule. The cost vector is c = (c\ j , . . . , c\ ? g, . . . , c/ ? i , . . . , C/,p)\ where c^ is the cost 
of an agent of type i with shift q. To any given shift vector x, there corresponds the staffing vector 
y = Ax, where A is a block-diagonal matrix with / identical blocks A 0 , if we assume that each agent 
of type / works as a type-/ agent for his/her entire shift. 

However, following Bhulai et al. (2007), we also allow an agent of type / to be downgraded to 
an agent with smaller skill set, i.e., of type i p where S ip C 5„ in any time period p of his/her shift. 
Define 5?+ = {j : Sj D Si A ?m : Sj DS m D Si} is thus the set of agent types whose skill set is a 
minimum strict superset of the skill set of agent type /) and S?r = {j : Sj c Si A fim : Sj cS m C Si} 
is thus the set of agent types whose skill set is a maximum strict subset of skill set of agent 
type j). To illustrate, consider a call centre with K = 3 call types, 7 = 4 agent types, and skill sets 
S\ = {1}, S 2 = {2} (specialist agents), S 3 = {2,3}, and = {1,2,3} (generalist agent); then we 
have, among others, = = ®* = {3}, and = { 1 , 3}. For each / and j € and each 
period p, we define the skill transfer variable z/ ; y jP , which represents the number of type-/ agents that 
are downgraded to type j during period p. Note that by performing multiple skill transfers during 
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a period, an agent of type / may end up being downgraded to any type whose skill set included in 
Si (in the previous example, a type 4 agent could be downgraded to type 3 and then to type 2, even 
though there are no 24,2,/? variables). 

A schedule x = (x { ti , . . . ,x\ & . . . ,x JA , . . . .x^q) 1 is said to cover the staffing y = {y u j , . . . - . . , 
J'/.i 1 • - • 0 : J.pT if for / = 1 , . . . , / and p = 1 , . . . , P % there are nonnegativc integers zjj %p for j e 5^ and 
z/,y, p for y € ^J", such that 

These inequalities can be written in matrix form as Ax + Bz > y, where z is a column vector whose 
elements are the z- % j %p variables and B is a matrix whose entries are in the set {—1,0, 1}. With this 
notation, the scheduling problem can be formulated as 

(P0) : [Scheduling problem] 

min c t x = S^ 1 lJ =1 c / .^ 
s.t 

Ax + Bz > y 

gk.piy) > 4,p to 1 < and 1 <p <P 
gp(y)>l P forl<p<P 

gk(y)>k fori<k<K 
g(y)>i 

x>0, z>0, y>0 and integer 



where l^ py l p , Ik and / are given constants. 

In practice, a given agent often works more efficiently (faster) when handling a smaller number 
of call types (i.e., if his/her skill set is artificially reduced). The possibility of downgrading agents to 
a smaller skill set for some periods can sometimes be exploited to take advantage of this increased 
efficiency. In case where the agent's speed for a given call type (in the model) does not depend on 
his/her skill set, one might think intuitively that downgrading cannot help, because it only limits the 
flexibility of the routing. This would be true if we had an optimal dynamic routing of calls. But 
in practice, an optimal dynamic routing is too complicated to compute and simpler routing rules 
are used instead. These simple rules are often static. Then, downgrading may sometimes help by 
effectively changing the routing rules. Clearly, the presence of skill transfer variables in (P0) cannot 
increase the optimal cost, it can only reduce it. 

Suppose we consider a single period, say period p, and we replace gk, P (y) and g p (y) by approx- 
imations that depend on the staffing of period p only, say g k . p (y\, p , . . - ,j>/./>) and g p (y\#, . . . ,yy, p ), 
respectively. If all system parameters are assumed constant over period p, then natural approxima- 
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tions arc obtained by assuming that the system is in steady-state over this period. The single-period 
multiskill staffing problems can then be written as 



(PI): [Staffing problem] 
s.t. 

&0 f i» — fori <k<K 

g{yu-.-,yi)>i 

yi > 0 and integer for all / 



where c, is the cost of agent type / (for a single period), and the period index was dropped throughout. 
Simulation-based solution methods for this problem are proposed in Ce?ik and L'Ecuyer (2007) and 
Avramidis et al. (2006). Pot et al. (2007) address a restricted version of it, with a single constraint 
on the aggregate SL over the period (i.e., they assume 4 = 0 for all k). 

In the approach of Bhulai et al. (2007), the first step is to determine an appropriate staffing, 
y = « • • • oh,/», - • • - - • , firf. For this, they look at each period p in isolation and solve a 
version of (PI) with a single constraint on the aggregate SL; this gives y x , p , . . . jj, p for each p. In 
their second step, they find a schedule that covers this staffing by solving: 



(P2): 


[Two-stage approach] 


min 




s.t. 






Ax + Bz >y 




x > 0, z > 0 and integer 



The presence of skill-transfer variables generally reduces the optimal cost in (P2) by adding 
flexibility, compared with the case where no downgrading is allowed. However, there sometimes 
remains a significant gap between the optimal solution of (P0) and the best solution found for the 
same problem by the two-step approach. The following simplified example illustrates this. 

Example 1 Let K = / = P = 3, and Q = 1 . The single type of shift covers the three periods. The 
skill sets are S\ = { 1 , 2}, S 2 = { 1 , 3}, and 5 3 = {2, 3}. All agents have the same shift and the same 
cost. Suppose that the total arrival process is stationary Poisson with mean 100. This incoming 
load is equally distributed between call types {1,2} in period 1, {1,3} in period 2, {2,3} in period 
3. Any agent can be downgraded to a specialist that can handle a single call type (that belongs to 
his skill set), in any period. In the presence of such specialists, an incoming call goes first to its 
corresponding specialist if there is one available, otherwise it goes to a generalist that can handle 
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another call type as well. When the agent becomes available, he serves the call that has waited the 
longest among those in the queue (if any). The service times are exponential with mean 1, there 
are no abandonments, and the SL constraints specify that 80% of all calls must be served within 20 
seconds, in each time period, on average over an infinite number of days. 

If we assume that the system operates in steady-state in period 1, then the optimal staffing for that 
period is 104 agents of type 1. Since all agents can serve all calls, we have in this case an M/M/s 
queue with s = 104, and the global SL is 83.4%, as can be computed by the Erlang-C formula. By 
symmetry, the optimal staffing solutions for the other periods are obviously the same: 104 agents of 
type 2 in period 2 and 104 agents of type 3 in period 3. Then, the two-step approach gives a solution 
to (P2) with 104 agents of each type, for a total of 312 agents. 

Solving (P0) directly instead (e.g., using the simulation-based algorithm described in the next 
section), assuming again (as an approximation) that the system is in steady-state in each of the three 
periods, we find a feasible solution with 35 agents of type 1, 35 agents of type 2, and 34 agents of 
type 3, for a total of 104 agents. With this solution, during period 1, the agents of types 2 and 3 are 
downgraded to specialists who handle only call types 1 and 2, respectively, and the agents of type 1 
act as generalists. A similar arrangement applies to the other periods, mutatis mutandis. Note that 
this solution of (P0) remains valid even if we remove the skill transfer variables from the formulation 
of (P0), because the sets Si~ and are all empty, if we assume that the routing rules do not change; 
i.e., if calls are always routed first to agents that can handle only this call type among the calls that 
can arrive during the current period. 

Suppose now that we add the additional skill sets 5 4 = {1}, S$ = {2}, S 6 = {3}, and that these 
new specialists cost 6 each, whereas the agents with two skills cost 7. In this case it becomes 
attractive to use specialists to handle a large fraction of the load, because they are less expensive, 
and to keep a few generalists in each period to obtain a "resource sharing" effect. It turns out that 
an optimal staffing solution for period 1 is 2 generalists (type 1) and 52 specialists of each of the 
types 4 and 5. An analogous solution holds for each period. With these numbers, if downgrading 
is not possible, the two-step approach gives a solution with 6 generalists (2 of each type) and 156 
specialists (52 of each type), for a total cost of 978. If downgrading is allowed, then the two-step 
approach finds the following much better solution: 2 agents of type 1 and 52 of each of the types 2 
and 3, for a total cost of 742. The skill transfer works in this way. In period 1: 52 agents of type 
2 are downgraded to specialists of type 4 and 52 of type 3 to specialists of type 5. In period 2: 2 
agents of type 1 are downgraded to agents of type 5, 52 of type 2 to type 6 and 50 of type 3 to type 
5. In period 3: 2 agents of type 1 are downgraded to agents of type 4, 50 of type 2 to type 4 and 52 
of type 3 to type 6. If we solve (P0) directly with these additional skill sets, we get the same solution 
as without them; i.e., 104 agents with two skills each, for a total cost of 728. This is again better 
than with the two-step approach, but the gap is much smaller than what we had with only three skill 
sets. 
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Example 2 In the previous example, if all the load was from a single call type, there would be a 
single agent type and the two-step approach would provide exactly the same solution as the optimal 
solution of (PO). The example illustrates a suboptimality gap due to a variation in the type of load. 

Another potential source of suboptimality (this one can occur even in the case of a single call 
type) is the time variation of the total load from period to period. If there is only a global SL 
constraint over the entire day, then the optimal solution may allow a lower SL during one (or more) 
peak period(s) and recover an acceptable global SL by catching up in the other periods. To account 
for this, Bhulai et al. (2007), Section 5.4, propose a heuristic based on the solution obtained by their 
basic two-step approach. Although this appears to work well in their examples, the effectiveness of 
this heuristic for general problems is not clear. 

Yet another type of limitation that can significantly increase the total cost is the restriction on 
the set of available shifts.. Suppose for example that there is a single call type, that the day has 
10 periods, and that all shifts must cover 8 periods, with 7 periods of work and a single period of 
lunch break after 3 or 4 periods of work. Thus a shift can start in period 1, 2, or 3, and there are 
six shift types in total. Suppose we need 100 agents available in each period. For this we clearly 
need 200 agents, each one working for 7 periods, for a total of 1400 agent-periods. If there were no 
constraints on the duration and shape of shifts, on the other hand, then 1000 agent-periods would 
suffice. 

3 Optimization by Simulation and Cutting Planes 

We now describe the proposed simulation-based optimization algorithm. The general idea is to 
replace the problem (P0) by a sample version of it, (SP0„), and then replace the nonlinear SL con- 
straints by a small set of linear constraints, in a way that the optimal solution of the resulting relaxed 
sample problem is close to that of (P0). The relaxed sample problem is solved by linear or integer 
programming. 

We first describe how the relaxation works when applied directly to (PO); its works the same way 
when applied to the sample problem. Consider a version of (P0) in which the SL constraints have 
been replaced by a small set of linear constraints that do not cut out the optimal solution. Let y be 
the optimal solution of this (current) relaxed problem. If y satisfies all SL constraints of (P0) ? then 
it is an optimal solution of (P0) and we are done. Otherwise, take a violated constraint of (P0), say 
g(y) < /, suppose that g is (jointly) concave in y for y > y, and that q is a subgradient of g at y. Then 

g(y)<g{y)+$(y-y) 

for all y > y. We want g(y) > /, so we must have 

/<g(y)<g(50+q t (y-y) ) 
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i.e., 

?y>?y+/-g(y). (2) 

Adding this linear cut inequality to the constraints removes y from the current set of feasible solu- 
tions of the relaxed problem without removing any feasible solution of (PO). On the other hand, in 
case q is not really a subgradient (which may happens in practice), then we may cut out feasible 
solutions of (PO), including the optimal one. We will return to this. 

Since we cannot evaluate the functions g exactly, we replace them by a sample average over n 
independent days, obtained by simulation. Let 0) represent the sequence of independent uniform 
random numbers that drives the simulation for those n days. When simulating the call center for 
different values of y, we assume that the same uniform random numbers are used for the same 
purpose for all values of y, for each day. That is, we use the same (O for all y. Proper synchronization 
of these common random numbers is implemented by using a random number package with multiple 
streams and substreams (Law and Kelton 2000, L'Ecuyer et al. 2002, L'Ecuyer 2004). 

The empirical SL over these n simulated days is a function of the staffing y and of <o. We denote it 
by gn&piVi ®) for cal1 t yP e k in Period p\ g n?P (y, <o) aggregated over period p; g n ,k(y, <o) aggregated 
for call type k; andg„(y,c») aggregated overall. For a fixed O), these are all deterministic functions 
of y. Instead of solving directly (P0), we solve its sample-average approximation (SP0„) obtained 
by replacing the functions g in (P0) by their sample counterparts g (here, g stands for any of the 
empirical SL functions, and similarly for g). 

We know that g„jt,p(y) converges to gk, P (y) with probability 1 for each (k,p) and each y when 
n — > oo. in this sense, (SP0„) converges to (P0) when n — > ©o Suppose that we eliminate a priori all 
but a finite number of solutions for (P0). This can easily be achieved by eliminating all solutions 
for which the total number of agents is unreasonably large. Let ^* be the set of optimal solutions 
of (P0) and suppose that no SL constraint is satisfied exactly for these solutions. Let be the 
set of optimal solutions of (SP0„). Then, the following theorem implies that for n large enough, an 
optimal solution to the sample problem is also optimal for the original problem. It can be proved 
by a direct adaptation of the results of Vogel (1994) and Atlason et al. (2004); see also Ce?ik and 
L'Ecuyer (2007). 

Theorem 1 With probability /, there is an integer Afo < °° such that for all w > No, = More- 
over, under the mild assumption that the service-level estimators satisfy a standard laige-deviation 
principle (see Assumption I in Ce?ik and L 'Ecuyer (2007)) , there are positive real numbers a and 
/3 such that for all n, 

P[%* = ^}>l-ae~^ n . 

We solve (SP0„) by the cutting plane method described earlier, with the functions g replaced 
by their empirical counterparts. The major practical difficulty is to obtain the subgradients q. In 
fact, the functions g in the empirical problem (computed by simulation) are not necessarily concave 
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for finite //. even in the areas where the functions g of (PO) are concave. To obtain a (tentative) 
subgradient q of a function g at y. we use forward finite differences as follows. For j = 1 , . . . JP f 
we choose an integer dj > 0, we compute the function g at y and at y + djCj for j = 1 . . . . . //>, where 
e 7 is the yth unit vector, and we define q as the /P-dimensional vector whose /th component is 

qj = \#y + dj*j)-i(y)]/dj. (3) 

In our experiments, we used the same heuristic as in Ce?ik and L'Ecuyer (2007) to select the d/s: We 
took dj = 3 when the SL corresponding to the considered cut was less than 0.5, dj = 2 when it was 
between 0.5 and 0.65, and dj = 1 when it was greater than 0.65. When we need a subgradient for a 
period-specific empirical SL (g p org k%p \ the finite difference is formed only for those components of 
y corresponding to the given period; the other elements of q are set to zero. This heuristic introduces 
inaccuracies, because g p and g* ;P depend in general on the staffing of all periods up to p or even 
p + 1, but it reduces the work significantly. 

Computing q via (3) requires IP+ 1 simulations of n days each. This is by far the most time- 
consuming part of the algorithm. Even for medium-size problems, these simulations can easily 
require an excessive amount of time. For this reason, we use yet another important short-cut: We 
generally use a smaller value of n for estimating the subgradients than for checking feasibility. (The 
latter requires a single w-day simulation experiment.) That is, we compute eachg(y + djej) in (3) 
using no < n days of simulation, instead of n days. In most of our experiments (including those 
reported in this paper), we have used «o « if / 10. 

With all these approximations and the simulation noise, we recognize that the vector q thus 
obtained is only a heuristic guess for a subgradient. It may fail to be a subgradient. In that case 
the cut (2) may remove feasible staffing solutions including the optimal one, and this may lead 
our algorithm to a suboptimal schedule; Atlason et al. (2004) and Ce?ik and L'Ecuyer (2007) give 
examples of this. For this reason, it is a good idea to run the algorithm more than once with different 
streams of random numbers and/or slightly different parameters, and retain the best solution found. 

At each step of the algorithm, after adding new linear cuts, we solve a relaxation of (SP0„) 
in which the SL constraints have been replaced by a set of linear constraints. This is an integer 
programming (IP) problem.. But when the number of integer variables is large, we just solve it 
as a linear program (LP) instead, because solving the EP becomes too slow. To recover an integer 
solution, we select a threshold S between 0 and 1; then we round up (to the next integer) the real 
numbers whose fractional part is larger than S and we truncate (round down) the other ones. These 
two versions of the CP algorithm are denoted CP-IP and CP-LP 

When we add new cuts, we give priority to the cuts associated with the global SL constraints, 
followed by aggregate ones specific to a call type, followed by aggregate ones specific to a period, 
followed by the remaining ones. This is motivated by the intuitive observation that the more aggre- 
gation we have, the smoother is the empirical SL function, because it involves a larger number of 
calls. So its gradient is less likely to oscillate and the vector q defined earlier is more likely to be a 
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subgradient. Moreover, in the presence of abandonments, the SL functions tend to be non-concave 
in the areas where the SL is very small, and very small SL values tend to occur less often for the ag- 
gregated measures than for the more detailed ones that were averaged. Adding cuts that strengthen 
the aggregate SL often helps to increase the small SL values associated with specific periods and 
call types. 

After adding enough linear cuts, we eventually end up with a feasible solution for (SP0„). This 
solution may be infeasible for (PO) (because of random noise, especially if n is small) or may be 
feasible but suboptimal for (PO) (because one of the cuts may have removed the optimal solution 
of (PO) from the feasible set of (SPOJ). To try improving our solution to (SP0„), we perform a 
local search around it. In the CP-LP version, before launching this local search, the solution must 
be rounded to integers. This is done using a threshold S as explained earlier. To determine this 
threshold, we perform a binary search over the interval [0, 1], up to an accuracy of 0.01 . to find the 
largest value of S that yields a feasible integer solution for (SP0„). 

The local search proceeds by iteratively considering longer simulations to check the feasibility 
of the solutions that it examines. The number of days used in these simulations, wi, starts from a 
value m (smaller than n) specified as an input parameter and increases at each iteration by 50% of 
this value. Each iteration of the local search tries to solve SP0„, 5 in three phases... In the first phase, 
the current solution is checked again for feasibility with the new value of n\ and agents are added 
at minimum cost until feasibility has been restored, if required. In the second phase, we attempt 
to reduce the cost of the solution by removing one shift at a time, until none of the possibilities is 
feasible. We further attempt to reduce the cost in the third phase by iteratively considering switch 
moves in which we try to replace an agent/shift pair by another one with smaller cost; the candidates 
for the switch moves are drawn at random, at each step, and the phase terminates when a maximum 
number of consecutive moves without improvement is reached. After the third phase, the current 
solution is tested for feasibility in a simulation of duration /i 3 = max(w,500) days. If it is feasible 
or if a time limit has been reached, the local search terminates, otherwise n\ is increased and a new 
iteration is performed. Thus, at the end of the local search procedure, we have a feasible solution 
for either SP0„, or SP0„ 3 . The reason for using shorter, but increasingly long, simulations in the 
local search is the need to find some balance between limiting the time required to evaluate a large 
number of candidate solutions and ensuring the feasibility of the solutions considered (it is pointless 
to spend time examining a large number of solutions if they all turn up to be infeasible). 

If we start the cutting plane algorithm with a full relaxation of (SP0„) (no constraint at all), the 
optimal solution of this relaxation is y = 0. The functions g are not concave at 0, and we cannot 
get subgradients at that point, so we cannot start the algorithm from there. As a heuristic to quickly 
remove this area where the staffing is too small and the SL is non-concave ? we restrict the set of 
admissible solutions a priori by imposing (extra) initial constraints. To do that, we impose that for 
each period /?, the skill supply of the available agents covers at least a* times the total load for each 
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call type A* (defined as the arrival rate of that call type divided by its sen ice rate), where each a k 
is a constant, usually close to 1. Finding the corresponding linear constraints is easily achieved by 
solving a max flow problem in a graph. See Ce?ik and L'Ecuyer (2007) for the details. 

4 Computational Results 

In order to assess the performance of the proposed algorithm, as well as the impact of flexibility on 
solutions, a number of problem instances were solved with the proposed algorithm and the two-step 
method. These instances were constructed so as to mimic the operations of real call centers. Their 
general setting is characterized as follows, unless stated otherwise. 

The call center opens at 8:00 AM and closes at 5:00 PM; the working day is divided into P = 36 
15-minute periods. Shifts vaiy in length between 6.5 hours (26 periods) and 9 hours (36 periods) and 
include a 30-minute lunch break near the middle and two 15-minute coffee breaks (one pre-lunch 
and one post-lunch). Overall, there are 285 possible shifts (see Table 1). 

Call arrivals are assumed to obey a stationary Poisson process over each period, for each call 
type, and independent across call types. The profile of the arrival rates in the different periods are 
inspired from observations in real-life call centers at Bell Canada (Avramidis et al. 2004). They are 
plotted separately for each instance. All service times are exponential with service rate \i = 8 calls 
per hour. Patience times have a mixture distribution: the patience is 0 with probability 0.001, and 
with probability 0.999, it is exponential with rate 0.1 per minute. The routing policy is an agents' 
preference-based router (Buist and L'Ecuyer 2005). 

For most instances, we only consider aggregate service level constraints for each period. These 
require that at least 80% of all the calls received during the period be answered within 20 seconds 
(i.e., we have x p = 20 seconds and l p = 0.8 for each p). The satisfaction of these constraints implies 
that the global constraint with x = 20 seconds and / = 0.8 is automatically satisfied, but we still 
require this explicitely, because this constraint plays a key role in the cutting-plane algorithm. In 
some cases, we also impose disaggregate SL constraints for each (call type, period) combination 
(k,p) with = 20 seconds and = 0.5 for all k and /?. Note that this in turn implies the 
satisfaction of aggregate SL constraints for each call type k with x k = 20 seconds and l k = 0.5. 

The formula used to compute agents' costs accounts for both the number of skills in the agent's 
skill set and the length of the shift being worked: 

c iq = ( 1 + (ij, - l)g)/^/30 for all / and q , (4) 

where l q is the length (in periods) of shift 30 is the number of periods in a "standard" 7.5-hour 
shift, r\i is the cardinality of S,, and q is an instance-specific parameter that represents the cost 
associated with each agent skill. 

We first compare the two solution methods described (i.e., TS and CP) on three instances that 
correspond to a small (section 4.1), a medium-sized (section 4.2) and a larger call center (section 
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Type length shift start 


hr^ak 1 •start 
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/ .JU 


SJ*ftn $?'in o-nn o-^n 
o.UU, o.JU, V.UU, V.jU 


y. JIM 1 .JU 


12:00, 12:30, 13:00 


14:00-15:30 


2 


7:45 


9:15 


10:45-1 1:15 


12 00 12 30 1100 




3 


8:00 


9:00 


10:30-11:00 


12:00, 12:30, 13:00 


14:00-15:30 


4 


8:15 


8:45 


10:15-10:45 


12:00, 12:30, 13:00 


14:00-15:30 


5 


8:30 


8:30 


10:00-10:30 


12:00, 12:30, 13:00 


14:30-16:00 


6 


9:00 


8:00 


11:00-12:00 


13:00, 13:30, 14:00 


14:45-16:15 


7 


6:30 


10:00 


11:30-12:00 


13:00, 13:30, 14:00 


14:15-15:45 



Table 1: Description of the 285 shifts for our examples 



4.3). For the medium-size center, two variants are considered: M u in which only aggregate SL 
constraints considered, and M 2 with aggregate and disaggregate SL constraints. For the larger center, 
we also examine the impact of having a longer working day. 

Both TS and CP use Cplex 9.0 to solve the optimization problems. To allow a fair comparison 
of the methods, we allocate the same CPU time "budget" to each. Considering the nature of the 
algorithms, this cannot be done by simply stopping them when this time limit is reached. Instead, 
we must carefully adjust, by trial and error, the number of simulated days n, which is a key parameter 
of both methods, to obtain running times close to the target budget. It is clear that one would not 
use such a procedure in a practical context, but this is necessary for the comparative study. For 
each instance, we consider several different budgets, since we expect that a higher value of n will 
produce more accurate and more stable results. Furthermore, in each case, r replications of each 
method/budget combination are performed to account for the random elements in both methods. 

In the first phase of TS, to simulate each individual period of the call center and evaluate the 
results of the simulations, we use the batch means method (Law and Kelton 2000). Each batch 
is constituted by a minimum number of 30 simulation time units and statistical observations are 
collected on a minimum of 50 batches, using 2 warmup batches before starting to collect statistics. 

Final solutions obtained by the two methods were simulated for «„ = 50, 000 days as an additional 
(much more stringent) feasibility test, and each solution was declared feasible or not according to 
the result of this test, i.e., according to the feasibility of (SP0„.). 

For each instance (or variant), results are summarized in a table with the following column head- 
ings: case, an index assigned for a specific CPU time budget; n, the number of simulated days for 
checking feasibility when adding cutting planes and for the local search at the end of the algorithm; 
»2, the starting number of days in local search simulations; CPUavg, the average CPU time per repli- 
cation; Min cost and Med cost, which are respectively the minimum and median costs of all solutions 
(feasible or not) obtained by this method over the r replications; P*, the percentage of replications 
that returned a feasible solution for (SP0„. ); and P{, the percentage that returned a feasible solution 
with cost within 1% of the best known feasible solution (the lowest-cost feasible solution for (SP0„.) 
generated by either algorithm, over all replications and CPU time budgets, in all experiments that 
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we have done, including those described in Section 4.4). We also report the maximum relative viola- 
tion gap (in percent) observed in a SL constraint for each type of constraints; G pcriod and G ca ii. period 
refer respectively to violations of SL constraints for periods and for individual (call type, period) 
combinations. 

4.1 A small call center 

This instance has K = 2 call types and / = 2 agent types, with 5, = {1 } and S 2 = { 1,2}. Agent costs 
are computed by setting the parameter g equals to 0.2 in formula 4. Arrival rates for the two call 
types are plotted in Figure 1. All SL constraints are enforced in this instance. Four different CPU 
time budgets were considered: 3, 15, 30 and 60 minutes... Results, based on /• = 32 replications, are 
displayed in Table 2. 
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Figure 1: Small center: arrival rates 
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CP-IP 


1600 
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1774 
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87 
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0.15 
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2800 




1753 
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0.79 
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CP-IP 


2000 


1000 


3694 


35.03 


35.17 
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0 
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TS 


6000 




3453 


35.67 


35.67 
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0 


0.79 
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Table 2: Small center: results obtained with CP-IP and TS for different CPU time budgets 



Several observations can be made from Table 2. First, CP-IP is almost always able to find feasible 
solutions to the problem and most of them are very good The only case where solutions are more 
expensive than those obtained with TS, and these are infeasible, is when n = 120, a rather small 
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value. On the other hand, the results for the three larger values of/? are quite similar and setting n 
to 1500 is probably sufficient for this small center It is also interesting, and surprising, to find out 
that all runs of TS failed to find a feasible solution, even though constraint violations were always 
inferior to 1%. The solutions produced by TS are also more expensive than the ones obtained with 
CP for large enough n. Such an outcome in not surprising considering the inherent shortcomings 
of the method. A closer examination of the distribution of the cost of solutions reveals that, except 
for CP-IP with n = 120, these costs vary very little. With TS, all runs with the same computing 
budget return in fact the same cost value. In practice, a manager might be willing to use almost- 
feasible solutions, considering the fact that the center will always experience stochastic variation in 
the arrival process and the SL in any case. For this reason, it is probably useful to report slightly 
infeasible solutions in general, and not only the feasible ones. 

The best scheduling solutions obtained by CP and TS are described in Table 3. In this table, shift 
types correspond to the length of the shifts as indicated in Table 1. Both methods return solutions 
with the same number of agents (31), most of which are specialists (type 1). Further analysis of 
these solutions reveals that they differ only slightly in terms of the duration of the shifts scheduled, 
but CP uses more specialists than TS (24 vs 21) and thus gives a cheaper solution. 
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Table 3: Small center: scheduling solutions 



Service levels for the best solution obtained are plotted in Figure 2. We see (1) a wide variation of 
the SL throughout the day and (2) that calls of type 1 have much better SL than those of type 2. This 
imbalance can be explained by the fact that the type 1 calls can be answered by less expensive spe- 
cialists, while type 2 calls must be handled by generalists. This observation highlights the fact that to 
ensure a fair treatment of all call types in a real-life setting, it is often necessary to include call-type 
specific SL constraints (either over the whole day or for each period) in the problem formulation. 

4.2 A medium-sized call center 

In the medium-sized instances, there are K = 5 call types and / = 15 agent types. Five of the agents 
types are specialists handling a single call type, while the ten other types are generalists handling 
between 2 and 5 call types. Details on skill sets can be found in Avramidis et al. (2007b). The 
parameter g used to compute agent costs in formula 4 is now equal to 0.1. Arrival rates for all call 
types are plotted in Figure 3. 
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Figure 2: Small center: service levels by period 

As mentioned earlier, we consider two variants of this example: in M u only global and per- 
period SL constraints are enforced, while M 2 also includes disaggregate SL constraints. Since in 
practice one may find it hard to satisfy all SL constraints and since real-life call center managers are 
often more interested by global SL, it seemed interesting to compare these two variants. For each 
of them, we performed r = 8 replications for several CPU time budgets of 15, 30 and 60 minutes. 
Running CP-IP would have taken unacceptable running times for these lai^e problem instances, so 
we used CP-LP instead. The results for M\ and M 2 are summarized in Tables 4 and 5. We find 
that most replications of CP-LP return low-cost feasible solutions for both variants, but the cost 
variation between solutions is more pronounced than for the small center. The quality of solutions 
also increases significantly with w, which emphasizes the importance of performing long enough 
simulations to obtain good results. Contrary to what was observed for the small center, TS always 
finds feasible solutions, but their cost is much higher than the cost of CP solutions; in fact, the 
optimality gap with respect to the best solution found is over 25% for M { and close to that value for 
M 2 . This shows that large suboptimality gaps with TS do occur in realistic call center settings, and 
not only in artificial examples. 

An examination of the best scheduling solutions obtained for M\ with CP and TS reveals that 
CP not only uses significantly fewer agents (14 vs 17), but also that these agents have on average 
a smaller number of skills (2..50 vs 2.76), and work shorter hours (more than 10 minutes less per 
day, on average). Similar results are also observed for Nh. The best solution found by CP for M\ 
is cheaper than the one found for M 2 \ this was expected since M x is a relaxation of M 2 . In fact, 
the most interesting conclusion that one could draw here is that the increase in cost incurred when 
imposing the disaggregate SL constraints is rather marginal. 
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Figure 3: Medium-sized center: arrival rates 



Case 


Algorithm 


n 


n 2 


CPUavg 


Min 


Med 


p; 


P* 


^period 










sec. 


cost 


cost 






1 


CP-LP 


300 


100 


855 


18.33 


18.85 


0 


87 


0.51 




TS 


1200 




897 


21.83 


21.83 


0 


100 


0 


2 


CP-LP 


600 


100 


1598 


17.56 


18.43 


0 


75 


0.41 




TS 


2400 




1774 


21.72 


21.72 


0 


100 


0 


3 


CP-LP 


1000 


400 


2686 


17.36 


17.66 


25 


87 


0.02 




TS 


3000 




2793 


21.69 


21.69 


0 


100 


0 



Table 4: M\ : results obtained with CP-LP and TS for different CPU time budgets 



4.3 A larger call center 

The larger center instances have K = 20 call types and / = 35 agent types. Twenty of the agents 
types are specialists handling a single call type, while the fifteen other types are generalists handling 
between 4 and 9 call types. Details on skill sets can be found in Avramidis et al. (2007b). Agent 
costs are computed with g = 0.1. We only consider SL constraints by period, plus the global SL 
constraint. Two CPU time budgets are examined: 5 and 10 hours. We use the CP-LP version of CP 
and perform r = 8 replications. 

One of our objectives with this example is to show that the performance of CP does not depend 
much on the particular structure of the shifts. We thus consider again two variants: L 36 , which uses 
the 9-hour working day and the same shift structure as the previous examples, and L s2 , which has a 
working day starting at 8:00 AM and ending at 9:00 PM; in that variant, the total number of periods 
is 52 and all the shifts have a fixed length of 7.5 hours, thus yielding a total of 123 different shifts 
(considering also shifts starting at 1:00 PM and 1 :30 PM in order to cover the additional periods). 
Arrival rates for the I 36 variant and the 36 first periods of L 52 , follow exactly the same pattern as in 
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Table 5: My. results obtained with CP and TS varying the CPU time budget 



the medium-size example (see Figure 3), with different scalings for the different call types; for L s2 
they then decrease slowly during the last 16 periods. The results for L 36 are displayed in Table 6. 

Here, CP has difficult)' finding a feasible solution with the smallest computing budget: only two 
of the eight runs returned feasible solutions, even though constraint violations might not be severe. 
This situation is clearly alleviated by allotting more CPU time. With more time, CP also finds 
significantly better solutions. TS always finds feasible solutions, but the solutions returned are on 
average 20% more expensive than those obtained with CP. This confirms our observations of the 
previous section regarding the poor performance of TS. Surprisingly, increasing the CPU budget 
does not improve the performance of TS. In fact, with more time, it returns inferior solutions. This 
is because the method obtains a different staffing solution in the first step; while this solution might 
track more closely the call arrival curve, it ends up leading to a poorer scheduling solution. A close 
examination of the relative cost distributions of solutions highlights the fact that, in the case of TS, 
all replications produce identical, and largely suboptimal, solutions. On closer examination of the 
best scheduling solutions obtained by the two methods, we find that the CP solution is less expensive 
because it covers the demand with only 52 agents compared to 62 for TS. 
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Table 6: Z, 36 : results obtained with CP-LP and TS for different CPU time budgets 



The results for the L 52 variant are reported in Table 7. On this larger problem, just one of the 8 
solutions found by CP-LP was declared feasible by the 50,000-day simulation, but it is an excellent 
solution with a cost of 131.7. With more time, CP returned 4 feasible solutions out of 8 runs, 
but these turned out to be inferior to the one found in 5 hours, probably due to simulation noise. 
Overall, these results emphasizes the importance of performing several trial runs when using this 
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type of approach. 
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Table 7: L 52 : results obtained with CP-LP and TS for different CPU time budgets 



All the solutions returned by TS were declared feasible, but they are significantly more expensive, 
but they are again highly suboptimal, with a cost of 156.1. When we examine the best solutions 
found by CP-LP and by TS for the L 52 instance, we first remark that CP-LP uses only 96 agents 
compared to 107 for TS. We also note that 15 agents in the CP solution are specialists, while there 
are none in the TS solution. Furthermore, TS uses 32 expensive generalists with 7 skills compared 
to 12 for CP. These three factors combined explain the large difference in cost. 

Our motivation for investigating the 52-period example was to verify that CP performed correctly 
for instances with a different shift structure. Our results confirm this, but at the same time they 
highlight one of the potential shortcomings of the approach, which is that, because of simulation 
noise, when there is a laige number of constraints, one often ends up with no feasible solution, even 
though several near-feasible solutions may have been identified. We address this issue in the next 
subsection. 

4.4 Getting more feasible results 

Empirical results show mat, as problem instances become laiger and more complex, there is a defi- 
nite possibility that CP would return a set of low-cost, but only nearly-feasible solutions. While this 
may be acceptable in some practical settings, it is nonetheless annoying to be unable to provide the 
call center manager with a solution that meets all his/her requirements. A simple and attractive way 
of tackling this problem consists in slightly increasing the right-hand side value of the SL constraints 
when applying the algorithm (except obviously for the final long simulation that is used to determine 
the feasibility of solutions). It should be noted that this idea is not specific to the CP procedure and 
could therefore be applied with any other solution approach. 

We first tested this idea on the L 52 instance, using values of 0.81 and 0.82 as target SL for all 
periods. We combined these tests with experiments on the value of the threshold S used for rounding 
continuous solutions to integer ones in CP-LP. The rationale for investigating different values of 8 
is that the rounding procedure introduces a heuristic element in what would otherwise be an exact 
procedure and that selecting the best value for this threshold is far from obvious. 
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In our experiments, we considered three different values of S (0.5, 0.6 and 0.7) for CPU budgets 
of 5 and 10 hours, and we ran 8 replications in each case. All 48 runs performed for each of the 
two target values turned out to be feasible for the 50,000-day simulation! Furthermore, one of these 
runs returned a better solution than the best one we had before (cost of 130.5 vs 131.7). The results 
obtained for a SL target of 0.81 are summarized in Table 8. These results show that the value 
selected for 8 seems to have a slight, but not critical impact on the quality of the solutions obtained. 
In fact, it seems to be much more important to make sure that the runs that are made do produce 
feasible solutions, in order to have a larger set to choose from. We then ran CP-LP with the original 
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Table 8: L 52 : results obtained with a target SL of 0.81 



0.80 target value for different values of 5. These tests clearly showed that modifying S alone was 
not sufficient to consistently obtain feasible solutions, since more than half of these runs returned 
infeasible solutions. However, they did allow us to find an even cheaper feasible solution with a cost 
130.4. We also ran the algorithm with a target SL value of 0.81 for the other instances (keeping 
the value of S unchanged at 0.5). The results can be summarized as follows: 

• For the small center, all runs returned feasible solutions, which is not surprising considering 
previous results, but these solutions were no better than the ones obtained previously. 

• For the medium-sized instances, all runs produced feasible solutions, and improved solutions 
were obtained both for the Mi and Af 2 instances (with respective costs of 17.22 and 17.39). 

• For the L 36 instance, 15 of the 16 runs yielded a feasible solution, but never better than the 
best one found with an SL target of 0.8. 

Overall, slightly increasing the value of the SL target value is a useful device for making sure that 
the method will return feasible solutions. However, there is no guarantee that better solutions will 
be found by doing so. The variability of our results highlights once again the stochastic nature of the 
algorithm, which cannot be avoided considering the significant amount of noise in the simulations. 
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4.5 The impact of flexibility 

We performed another series of numerical experiments to quantify empirically the impact of the 
flexibility provided by a rich set of shift types. Those experiments were performed on the small 
center of subsection 4. 1 . We considered three sets of shift types: the original one with all 285 shifts, 
a slightly reduced one with 267 shifts, obtained by deleting the 26-period and some 36-period shifts, 
and adding some 35-period ones, and finally a much more restricted set in which we only allow the 
105 7.5-hour shifts. The staffing solutions corresponding to the best scheduling solutions obtained 
for these three cases are plotted in Figure 4, along with the optimal staffing solution computed by 
considering each period individually. Three main conclusions can be drawn from this figure: 

1. As shown by the solution with 285 shifts, if enough flexibility is introduced in the set of 
available shifts, it is possible to find schedules that track closely the staffing requirements. 

2. Even a slight decrease in flexibility (e.g., by going from 285 to 267 shifts) can lead to a 
significant overstaffing in some periods. 

3. Schedules with a relatively small number of fixed-length shifts (the 105-shift case) are bound 
to suffer from major overstaffing. 

It follows that, while the complexity of the scheduling problem significantly increases with the 
number of available shifts, there are definite benefits to be reaped from the introduction of more 
varied shift types. 
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Figure 4: Small center, staffing solutions with 105, 267 and 285 shift types 
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5 Conclusion 



We have proposed in this paper a simulation-based methodology to optimize agent scheduling over 
one day in a multiskill call center. Even though the use of common random numbers reduces the 
simulation noise (or variance) significantly, there is still a fair amount of randomness in the solution 
provided by the algorithm, mainly due to the fact that the simulation length must be kept short 
(because the estimation of each subgradient requires simulations at up to thousands of different 
parameter values). Yet, to our knowledge, better solutions are found with this approach than with 
any other method we know. In particular, during the development of the cutting plane algorithm, we 
also implemented simultaneously a metaheuristic method based on neighborhood search combined 
with queueing approximation, along the lines of Avramidis et al. (2006), but we were unable to make 
it competitive for solving the scheduling problem. 

In practice, one may run the algorithm a few times (e.g., overnight) to obtain a few solutions and 
retain the best found. We also showed that by slightly perturbing the SL targets, it is possible to 
overcome some of the problems caused by the presence of the simulation noise and thus to greatly 
increase the probability of obtaining feasible, high-quality solutions. 

Future research on this problem include the search for faster ways of estimating the subgradients 
(by simulation), refining the algorithm to further reduce the noise in the returned solution, and 
extending the technique to simultaneously optimize the scheduling and the routing of calls (via 
dynamic rules). 
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